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A METHOD FOR TREATING HERPES VIRUSES 



FIELD OF THE INVENTION 
The present invention relates to a method for selecting an anti-herpes viral 
5 compound and a method for selectively inhibiting herpes viruses in a human host in need of 
such treatment. 



BACKGROUND OF THE INVENTION 
The herpesviruses comprise a large family of double stranded DNA viruses. Eight 
10 of the herpes viruses, herpes simplex virus types 1 and 2 (HSV-1 and HSV-2), varicella 
zoster virus (VZV), human cytomegalovirus (HCMV), Epstein-Barr virus (EBV), and 
human herpes viruses 6, 7, and 8 (HHV-6, HHV-7, and HHV-8), have been shown to infect 
humans. Several of these viruses are important human pathogens. 

HSV-1 is estimated to affect 100 million people in the U.S. Primary infection of 
15 HSV-1 usually occurs between the ages of one and four. Cold sores, the visible symptom, 
typically appear at a later age, with 20-45% of the population over the age of fifteen 
affected (Whitley, Clin. Meet. Dis., 26:541-555, 1998). 

Genital herpes (HSV-2) is the second most common sexually transmitted disease, 
with approximately 22% of the U.S population infected with this virus (Fleming 1997). 
20 *■ VZV is the causative agent of chicken pox upon primary infection and can recur in 

adults as zoster. 

EBV results in approximately two million cases of infectious mononucleosis in the 
U.S. each year. It can also cause lymphomas in immunocompromised patients and has been 
associated with Burkitt's lymphoma, nasopharyngeal carcinoma, and Hodgkins disease. 
25 Infection with HCMV often occurs during childhood and is typically asymptomatic 

except in immunocompromised patients where it causes significant morbidity and 
mortality. 

HHV-6 is the causitive agent of roseola and may be associated with multiple 
sclerosis and chronic fatigue syndrome. HHV-7 disease association is unclear, but it may 
30 be involved in some cases of roseola. HHV-8 has been associated with Karposi's sarcoma, 
body cavity based lymphomas, and multiple myeloma. 

These viruses are capable of residing in a latent state within the host. Reactivation 
of latent virus results from response to environmental stimuli (ex. UV exposure, stress, 
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etc.). Infections or recurrence can be life threatening in immunocompromised patients such 
as AIDS or transplant patients where HCMV can result in retinitis, pneumonia, and 
gastrointestinal disease. 

The increased immunocompromised population has created an unmet medical need 

5 for antivirals against herpesviruses because current therapies do not have a sufficiently 

broad spectrum against this family of viruses and/or they have limited utility due to toxicity. 
The present invention provides a method for selectively inhibiting herpesviruses DNA 
polymerase with compounds that have broad spectrum activity. The method offers a 
distinct advantage in the treatment of patients in need, particularly immunocompromised 

10 patients at risk of infection or reactivation by many members of the herpesvirus family. 



SUMMARY OF THE INVENTION 
The present invention provides a method of selecting compounds that inhibit herpes 
viruses comprising: 

15 a) measuring IC 50 of a compound of interest that inhibits a wild type herpes virus, 

b) measuring IC 50 of the same compound that inhibits a binding domain mutant herpes 
virus which is the same strain of the wild type herpes virus, 

c) comparing IC 5 o of step a with IC50 of step b; and 

d) selecting the compound of interest wherein the IC50 of step b is at least 3 times 
20 greater than the IC 50 of step a. 

In above method, the order of step a and step b are interchangeable. 
The present invention further provides a method of selecting compounds that inhibit 
herpes viruses comprising: 

a) measuring IC 50 of a compound of interest that inhibits a wild type HSV-1, 
25 b) measuring IC 50 of the same compound that inhibits a binding domain mutant HSV-1 
which is the same strain of the wild type herpes virus, 

c) comparing IC50 of step a with IC 50 of step b; and 

d) selecting the compound of interest wherein the IC50 of step b is at least 3 times 
greater than the IC50 of step a. 

30 In above method, the order of step a and step b are interchangeable. 

The present invention further provides a method of selecting compounds that inhibit 
herpes viruses comprising: 

a) measuring IC 50 of a compound of interest that inhibits a wild type HS V-2, 
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b) measuring IC 50 of the same compound that inhibits a binding domain mutant HS V-2 
which is the same strain of the wild type herpes virus, 

c) comparing IC50 of step a with IC50 of step b; and 

d) selecting the compound of interest wherein the IC 50 of step b is at least 3 times 
5 greater than the IC50 of step a. 

In above method, the order of step a and step b are interchangeable. 
The present invention further provides a method of selecting compounds that inhibit 
herpes viruses comprising: 

a) measuring IC 50 of a compound of interest that inhibits a wild type HCMV, 
10 b) measuring IC 50 of the same compound that inhibits a binding domain mutant 
HCMV which is the same strain of the wild type herpes virus, 

c) comparing IC50 of step a with IC50 of step b; and 

d) selecting the compound of interest wherein the IC50 of step b is at least 3 times 
greater than the IC50 of step a. 

15 In above method, the order of step a and step b are interchangeable. 

The present invention further provides a method for selectively treating diseases 
caused by herpes viruses in a human host comprising administering a compound to a human 
in need of such treatment wherein said compound inhibits herpes viruses by interaction with 
the binding domain in the viral DNA polymerase. 

20 The present invention further provides method for selectively inhibiting herpes 

viruses in a human host comprising administering a compound to a human in need of such 
treatment wherein IC 50 of the compound that inhibits a binding domain mutant herpes virus 
is at lease 3 times greater than IC 50 of the compound that inhibits a wild type herpes virus 
which is the same strain as the mutant herpes virus. 

25 The present invention further provides a compound for treating herpesviral 

infections in a human host wherein IC50 of the compound that inhibits a binding domain 
mutant herpes virus is at lease 5 times greater than IC50 of the compound that inhibits a wild 
type herpes virus which is the same strain as the mutant herpes virus. 

The present invention further provides a compound for treating herpesviral 

30 infections in a human host wherein said compound inhibits the herpesvirus by interacting 
with the binding domain in the viral DNA polymerase. 

The present invention further provides a compound for the inhibiting of herpesvirus 
DNA polymerases wherein serial passage of a wild type herpes virus in the presence of said 
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compound results in a change of the wild type HSV-1 polymerase at amino acid 823 from 
valine to alanine. 

The present invention further provides a compound for inhibiting herpesvirus DNA 

polymerases wherein serial passage of a wild type herpes virus in the presence of said 
5 compound results a change of the wild type HCMV polymerase at amino acid 823 from 

valine to alanine and at amino acid 824 from valine to leucine. 

The present invention further provides a mutant herpesvirus DNA molecule having 

a nucleotide sequence selected from a group consisting of SEQ.ID.NO. 1; SEQ.ID.NO. 3; 

SEQ.ID.NO. 5; SEQ.ID.NO. 7; SEQ.ID.NO. 9; and SEQ.ID.NO. 11. 
10 The present invention further provides a mutant herpesvirus polymerase amino acid 

molecule having an amino acid sequence selected from a group consisting of SEQ.ID.NO. 

2; SEQ.ID.NO. 4; SEQ.ID.NO. 6; SEQ.ID.NO. 8; SEQ.ID.NO. 10 and SEQ.ID.NO. 12. 



BRIEF DESCRIPTION OF THE DRAWINGS 
15 Figure 1 - examples of 4-oxo-DHQ and 4-oxo-DHTP compounds. 

Figure 2 - Herpesvirus' polymerases amino acid conserved region. 
Figure 3 - Recovered virus after serial passage of HSV-1 in presence of 20 jxM of 
compound No. 17. 

Figure 4 - Comparision of Wild HSV-1 and HSV-2 herpesvirus DNA polymerase 
20 amino acid sequences alligned by amino acid homology. (Seq. No: 14-19) 

Figure 5 - Mutant Herpes Virus DNA and amino acid sequence list. (Seq. No: 1-12) 
Figure 6 - Wild HCMV herpesvirus DNA polymerases amino acid sequence. (Seq. 

No 13) 

25 DETAILED DESCRIPTION OF THE INVENTION 

A key enzyme in the replication of all herpesviruses is the virus-coded DNA 
polymerase. Most of the currently available anti-herpes drugs target the viral DNA 
polymerase. Drugs such as Foscarnet acts by direct inhibition of the viral polymerase. These 
drugs are non-nucleoside inhibitors of herpesvirus DNA polymerases. Others such as the 

30 nucleoside analogs, Acyclovir, Penciclovir and Ganciclovir must first be phosphorylated to 
the monophosphate forms by virus encoded kinases and, further phosphorylated to 
triphosphate by cellular enzymes before they are active inhibitors. The triphosphate forms 
of these nucleoside analogs inhibit polymerases by competing with the binding of natural 
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triphosphates and their subsequent insertion into growing DNA strands. These drugs are 
known as nucleoside inhibitors of herpesvirus DNA polymerases. 

One of the limitations of the currently available drugs is that they are active against 
only a few of the eight human herpesviruses. For example, Acyclovir and Penciclovir 

5 inhibit HS V and VZV replication but have poor activity against CMV. 

In order to identify antiviral compounds that would have the potential to inhibit 
replication of most of the human herpesviruses, compounds are in vitro screened for 
inhibitors of herpesvirus DNA polymerase activity. Because portions of the amino acid 
sequence of the polymerases are highly conserved within the herpesvirus family it is 

10 possible to discover small molecules that inhibit herpesvirus polymerases but not cellular 
DNA polymerases- Using this biochemical approach, several new classes of compounds 
such as the 4-hydroxyquinoline derivatives (4-HQ), 4-oxo-dihydroquinoline derivatives (4- 
oxo-DHQ) and 4-oxo-dihydrothienopyridine derivatives (4-oxo-DHTP) were discovered as 
potent, non-nucleoside herpesvirus DNA polymerase inhibitors. In vitro polymerase assays 

15 and/or in vivo cell culture assays have demonstrated that these compounds inhibit HSV-1, 
HSV-2, HCMV, VZV, EBV, and HHV-8 replication. 



20 wherein ring A is a saturated or unsaturated fused double or triple heterocyclic ring having 

1, 2, 3 or 4 heteroatoms selected from group consisting of oxygen, sulfur, or nitrogen; and 

wherein R and X are the appropriated substitutents, respectively. 

Examples of 4-HQ compounds, 4-oxo-DHQ compounds and 4-oxo-DHTP 

compounds are illustrated in Figure 1. 
25 Antiviral activity of these examples are shown in Table 1 below. As shown in Table 

1, these compounds inhibit HSV-1 and HSV-2 as well or better than the current 

commercially available drug Acyclovir. 



4-Oxo-DHQ and 4-oxo-DHTP are derivatives of formula I 



o 




I 
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Table 1 

Antiviral Activity of 4-oxo DHQ/4-oxo DTHP Against HSV-1 and HSV-2 



Compound IC 5 o (uM) 


virus 


1 


2 


3 


4 


5 


ACV 


HSV-1 KOS 


2.0 


3.8 


3.2 


3.2 


3.3 


3.6 


HSV-1 F 


2.5 


2.3 


2.2 


2.1 


2.6 


1.3 


HSV-1 DJL 


2.5 


2.6 


1.8 


2.2 


2.7 


1.8 


HSV-1 Patton 


ND 


5.3 


7.7 


4.3 


10 


9.3 


HSV-2 MS 


2.0 


2.5 


2.8 


2.5 


2.5 


10 


HSV-2 35D 


ND 


5.4 


5.0 


3.2 


8.1 


6.0 


HSV-2 186 


2.0 


2.3 


3.2 


2.3 


4.2 


>10 I 



5 It has also been discovered that point mutations within the HSV-1 polymerase gene 

that confer resistance to Acyclovir and other nucleoside analogs do not result in resistance 
to the 4-HQ, 4-oxo-DHQs or 4-oxo-DHTPs. Serial passage of wild type HSV-1 in the 
presence of 4-oxo-DHQ results in the isolation of mutants that are highly resistant (>20 fold 
increase in the IC 50 ) to these compounds while retaining sensitivity to nucleoside inhibitors 

10 such as Acyclovir. 

In order to determine the mechanism of action of 4-HQ, 4-oxo-DHQ and 4-oxo- 
DHTP compounds against herpes viruses, mutants resistant to these compounds are isolated 
by serial passage of the virus in the presence of a 4-oxo-DHQ compound. Sequencing 
analysis of HSV-1 and HSV-2 strains resistant to the 4-oxo-DHQ identifies that HSV-1 

15 (KOS strain) polymerase protein and its homologous HSV-2 have a conserved region (a 
binding domain), which is a critical contact point for these compounds. While amino acid 
numbering of the DNA polymerase may vary between strains of HSV-1 and HSV-2, this 
binding domain encompassing the HSV-1 (KOS) strain amino acid 823 is highly conserved 
in herpesviruses and can be identified by alligning the homologous amino acids of this 

20 domain as shown in Fig 2. 

In HSV-1 and HSV-2 strains resistant to the 4-oxo-DHQ and similar compounds, a 
change of valine to an alanine at the binding domain provides full resistance. 

In the HSV-1 DNA polymerase, resistance is also found when a valine changes to 
methionine at amino acid 823 but only when accompanied by a second amino acid change. 

25 Isolation of HCMV resistant to 4-oxo-DHQ' s is found to be very difficult. 

Comparison of the amino acid sequence of the HSV polymerase (Y-G-F-T-G-V-Q-H-G) 
and HCMV polymerase (Y-G-F-T-G-V-V-N-G) in the region of amino acid 823 
(underlined amino acid) shows that there is a second valine at position 824 in the HCMV 
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polymerase. In vitro assay using mutant HCMV polymerases demonstrates that full 
resistance to the 4-oxo-DHQs requires changes at both amino acids 823 (a valine to alanine) 
and 824 (a valine to leucine). A HCMV polymerase gene containing V823A and V824L 
mutations is used in marker rescue experiments to generate a viral mutant. This mutant has 

5 an IC 5 o approximately 7-fold above that of wild-type HCMV. 

The HSV-1, HSV-2 and HCMV mutants are also found to be resistant to other non- 
nucleoside inhibitors such as the 4-oxo-DHTP and similar compounds. However, when the 
binding domain mutants (e. g. HSV-1 V823A, HSV-2-MS V826A, HSV-2-186 V828A, and 
HCMV V823A/V824L mutants) are tested in plaque reduction assays against a series of . 

10 nucleoside polymerase inhibitors and the non-nucleoside inhibitor such as Foscarnet, 
replication of the mutants is found to be inhibited by all of the currently marketed anti- 
herpes polymerase inhibitors tested. 

These studies demonstrate that certain non-nucleosides like 4-HQ, 4-oxo-DHQ and 
4-oxo-DHTP compounds bind to a different site on the herpes polymerase than the 

15 nucleoside inhibitors and Foscarnet. The valine at the binding domain is conserved in the 
DNA polymerases of six of the eight human herpesviruses and several animal 
herpesviruses, and appears to play a critical role in the antiviral activity of the 4-HQ, 4-oxo- 
DHQ and 4-oxo-DHTP compounds. (See Figure 2) 

Since mutation at the binding domain negates these non-nucleoside inhibitors' 

20 activities, compounds could be tested against wild type polymerases and the mutant 

polymerases to establish the probability of similar binding. We refer to this property of 
compounds as interaction with the binding domain. Since compounds that interact with the 
binding domain have exhibited broad-spectrum activity against herpesviruses, this 
invention provides a method for selecting compounds to treat individuals such as 

25 immunocompromised patients who are afflicted with multple herpesvirus infections. 

Definitions 

The term " wild-type 11 refers to a gene or gene product which has the characteristics 
of that gene or gene product when isolated from a naturally occurring source. A wild-type 
30 gene is that which is most frequently observed in a population and is thus arbitrarily 
designated the "normal" or " wild-type" form of the gene. 

In contrast, the term "mutant" refers to a gene or gene product which displays 
modifications in sequence and or functional properties (i.e., altered characteristics) when 
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compared to the wild-type gene or gene product. It is noted that naturally-occurring 
mutants can be isolated; these are identified by the fact that they have altered 
characteristics when compared to the wild-type gene or gene product. 

IC 50 refers to concentration of a drug that inhibits virus growth by 50%. 
5 Wild type HSV-1 and HSV-2 strains are listed in Figure 4. 

Wild type HCMV is listed in SEQ. ID. NO. 13. 
The term "Iudr" refers to antiviral drug Iododeoxyuridine. 
The term "Bvdu" refers to antiviral drug Bromovinyldeoxyuridine. 
The term "ACV" refers to antiviral drug Acyclovir. 
10 The term "AraC" refers to antiviral drug Arabinosylcytidine. 

The term "AraT" refers to antiviral drug Arabinosylthymine. 
The term "AraA" refers to antiviral drug Arabinosyladenine. 
The term "GCV" refers to antiviral drug Ganciclovir. 
The term "CDV" refers to antiviral drug Cidofovir. 
15 The term "PFA" refers to antiviral drug Foscarnet. 

The term "binding domain" refers to a conserved region in herpesvirus DNA 
polymerases. The herpesvirus DNA polymerases have seven (7) conserved regions. The 
binding domain is within the thrid conserved region (see Figure 2). When the binding 
domain contacts with an inhibitor, at least one amino acid in the binding domain mutates 
20 and provides the resistance. In general, the binding domain is at an amino acid sequence 
position 818-829 of the HSV-1 DNA polymerase or the homologous region in other herpes 
virus DNA polymerases (see Figure 2). 

The term "a binding domain mutant herpes virus" refers to a herpes virus containing 
a binding domain mutation. 
25 More specifically, the binding domain in HSV-1 strains, KOS, F, DJL and Patton 

are at amino acid sequence position 823. The binding domain in HSV-2 MS -Ml strain is at 
amino acid sequence position 826. The binding domain in HSV-2 186 strain is at amino 
acid sequence position 828. The binding domain in HCMV AD 169 strains is at amino acid 
sequence position 823-824. 
30 The term "XxxxY" refers to an amino acid sequence position xxx, a single amino 

acid X in wild type is changed to an amino acid Y. 

For example, the term "V823A" refers to an amino acid sequence position 823, a 
Valine found in wild type is changed to alanine in mutant strain. 

-8- 
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The term "V824L" refers to an amino acid sequence position 824, a Valine found in 
wild type is changed to Leucine in mutant strain. 

The term "V826A" refers to an amino acid sequence position 826, a Valine found in 
wild type is change to alanine in mutant strain. 
5 The term "V828A" refers to an amino acid sequence position 828, a Valine found in 

wild type is change to alanine in mutant strain. 

A table of amino acids and their representative abbreviations, symbols and codons is 
set forth below in the following Table. 



lAmino acid 


Abbrev. 


Symbol 






Codon(s) I 


Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










Aspartic acid 


Asp 


D i 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


UUC 


UUU 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






Histidine i 


His 


H 


CAC 


CAU 










Isoleucine 


He 


I 


[AUA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA ! 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


cuu 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


UCG 


ucu 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


W 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











MATERIALS AND METHODS 

Cell and Viruses 

African green monkey kidney cells (Vero) and human foreskin fibroblast cells 
15 (HFF) and herpes viruses can be obtained from the American Type Culture Collection 

(ATCC). Media is defined as Dulbecco's modified Eagle media (DMEM) containing 10% 
fetal bovine serum (FBS) and supplemented with antibiotics. Cells are maintained in media 
at 37°C in a humidified atmosphere of 5% CO 2 . HSV-1 strains F, Patton and DJL, HSV-2 
strains MS, 35D and 186, and HCMV strain AD169 are used in these studies. Strain DJL is 
20 a clinical isolate of HSV-1 isolated in our lab from a primary oral lesion. 
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Measuring ICUo of a Compound of Interest That Inhibits Herpes Viruses 
Preparation of Virus Stocks: HSV-1 and HSV-2 stocks are grown in Vero cells. 

HCMV stocks are grown in HFF cells. Approximately 1 ml of media containing sufficient 
virus to infect approximately 0.1% to 1% of the cells (multiplicity of infection of 0.001 to 

5 0.01 PFU/cell) is added to a T-150 cell culture flask containing a confluent monolayer of 
cells. The cells are incubated at 37°C for approximately 1 hour. Approximately 50 ml of 
media is then added to the flask and the cells are incubated at 37°C until viral cytopathic 
effect (cpe) is apparent in 100% of the cells. The flask is then placed at -80°C for at least 
30 min. The flask containing frozen media and cells is placed in a 37°C water bath until the 

10 media is thawed. This process disrupts the cells and releases virus into the media. 1 ml 
aliquots of media containing virus are dispensed into tubes and stored at -80°C. These 
aliquots of media containing virus are referred to as virus stocks. 

Titrating Virus Stocks: Aliquots of virus are thawed at 37°C and serially diluted (10 
fold dilutions) in media. 0.1 ml of each dilution of virus is placed in a single well of 24- 

15 well cell culture dish containing a confluent monolayer of cells (Vero cells for HSV-1 and 
HSV-2, HFF cells for HCMV) and incubated at 37°C for 1 h. The virus innoculum is then 
removed and 1 ml of media containing 0.8% carboxymethylcellulose (CMC) is added to 
each well of the dish. The dish is incubated at 37°C for approximately 2-3 days (HSV-1 
and HSV-2) or 6-9 days (HCMV) to allow sufficient growth of virus to form plaques in the 

20 cell monolayer. Plaques can be observed and counted microscopically or by staining the 
cells with 0.1% crystal violet in 20% ethanol. The virus titer which is expressed as plaque 
forming units (PFU) per ml is obtained by counting the plaques in a well and correcting for 
the dilution of the viral innoculum. 

Plaque Reduction Assays: Antiviral activity of compounds against herpesviruses such as 
25 HSV-1, HSV-2, or HCMV can be measured using plaque reduction assays. 0. 1 ml of media 
containing approximately 50 PFU of virus is added to each well of a 24-well cell culture 
dish containing a confluent monolayer of cells (Vero cells for HSV-1 and HSV-2, HFF cells 
for HCMV). Compounds are dissolved in 100% DMSO and diluted in 100% DMSO as 
200x stocks of the desired final drug concentration. Typically 5-6 two-fold dilutions are 
30 prepared for each compound. Dilutions of compounds are then added to media containing 
0.8% CMC resulting in a final lx drug concentration. After the virus-infected cells have 
incubated for 1 h at 37°C, the virus innoculum is removed and 1 ml of media containing 
0.8% CMC and the various concentrations of compound is added to each well of the dish. 
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The dish is incubated at 37°C for approximately 2-3 days (HSV-1 and HSV-2) or 6-9 days 
(HCMV) to allow sufficient growth of virus to form plaques in the cell monolayer. Plaques 
can be observed and counted microscopically or by staining the cells with 0.1% crystal 
violet in 20% ethanol. Virus inhibition is determined for each drug concentration by 
5 comparing the number of plaques in drug-containing wells to control wells that did not 
contain drug. Antiviral activity of a compound is expressed as the concentration of 
compound predicted to reduce the number of plaques in a well by 50% (IC50). The IC50 
values are calculated by plotting the per cent inhibition vs. concentration of compound 
using EXCEL software for linear regression. 

10 

Selection of 4-oxo-DHO resistant HSV-1 and HSV-2 

Vero cells are plated out at a density of 3.5x1 0 5 cells per well in a six well tissue 
culture plate. Cells are infected with HSV-1 KOS at a multiplicity of infection (moi) of 
0. 1 pfu/cell and 1 h post infection the cells are overlayed with 3 ml media containing 20 

15 uM of a 4-oxo-DHQ. Cultures are incubated for 20 h at 37°C, freeze/thawed to release 
cell-associated virus, and 0.1 ml of culture is used to infect a new monolayer of Vero cells 
(one passage). Serial passage is repeated seven times in the presence of 20 uM drug. Virus 
isolates are then plaque purified three times prior to preparation of stocks. Virus recovered 
from each passage in the presence of compound No. 17 is shown in Figure 3. 4-oxo-DHQ 

20 resistant HSV-1 and HSV-2 may also be selected by the marker transfer method described 
below using wild-type HS V DNA and the corresponding mutant HSV polymerase gene. 

Marker Transfer of a HCMV Mutation 

A plasmid containing the wild-type HCMV polymerase gene is modified to contain 
25 the V823A or V823A and V824L mutations using a site-directed mutagenesis Kit 

(Stratagene Corp.) and following the manufactures^ protocol. HFF cells are plated into 
T25 tissue culture flasks to achieve 80% confluency at the time of the transfection. Wild 
type HCMV AD 169 DNA and plasmid DNA containing the mutant HCMV polymerase 
gene are mixed at a ratio of 1 :2 (2ug of viral DNA to 4 ug of plasmid DNA). DNA's are 
30 transfected using superfect transfection reagent according to methods recommended by the 
manufacturer (Quiagen Inc.). Cells are harvested five days posttransfection, freeze-thawed 
to release virus and half of the sample is used to infect HFF cell monolayers. Cells are 
overlayed with media containing 20 uM 4-oxo-DHQ compound 2 (see Figure 1). Serial 
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passage is repeated seven times in the presence of 20 uM compound 2 and virus isolates are 
then plaque purified three times prior to preparation of viral stock. 



Isolation of HS V and HCMV viral DNA 

5 HSV DNA is purified from the cytoplasm of infected Vero cells. Vero cells (50 % 

confluent) are infected at an multiplicity of 0.01 PFU/cell. At 3-5 days postinfection 
infected cells (100% cpe) are harvested by centrifugation at 1000 rpm in a Beckman GS-6R 
centrifuge. The pelleted cells are resuspended in TE buffer and placed on ice for 15 
minutes. NP-40 is then added to a final concentration of 0.2% and incubated on ice for a 

10 further 15 minutes. The cells are centrifuged at 2000 rpm for 10 minutes in a Beckman 

GS-6R centrifuge. The supernatant is removed and EDTA is added to a final concentration 
of 20 mM followed by the addition of SDS to a final concentration of 0.3% and proteinase 
K to a concentration of 50 ug/ml then incubated at 45C for 2 hours. HCMV DNA is isolated 
by infecting HFF cells (25% confluency) with HCMV at an multiplicity of 0.1 PFU/cell. 

15 Cells and media are harvested 5-7 days postinfection (100% cpe) and subjected to low 

speed centrifugation to remove intact cells and cell debris followed by a high speed spin to 
pellet virus particles (2500 rpm's in a Beckman SW28 rotor for 1 hour). Following 
incubation of the HSV and HCMV samples, 1.5 volumes of saturated Nal is added to the 
digested extract and the refractive index is adjusted to 1.434 -1.435. Ethidium bromide is 

20 added to a final concentration of 50 ug/ml. The samples are loaded into a VTI SOcentrifuge 
tube and spun for 24 hours at 45,000 rpm. The DNA band is harvested extracted three times 
with n-butanol, then dialyzed against TE buffer followed by a dialysis against 95% ethanol 
and a final dialysis against TE buffer. 

25 DNA Sequencing 

HSV-1, HSV-2 or HCMV viral DNA's are sequenced directly using an ABI377 
fluorescence sequencer (Perkin Elmer Applied Biosystems, Foster City, CA) and the ABI 
BigDye PRISMTM dRhodamine Terminator Cycle Sequencing Ready Reaction Kit with 
AmpliTaq FSTM DNA polymerase (PE Applied Biosystems). Each cycle sequencing 
30 reaction contained about 1.0 ug of purified viral DNA. Cycle-sequencing is performed 
using an initial denaturation at 98°C for 1 min, followed by 50 cycles: 98°C for 30 sec, 
annealing at 50°C for 30 sec, and extension at 60°C for 4 min. Temperature cycles and 
times are controlled by a Perkin-Elmer 9700 thermocycler. Extension products are 
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purified using CentriflexTM gel filtration cartridges (Edge BioSystems, Gaithersburg, MD). 
Each reaction product is loaded by pipette onto the column, which is then centrifuged in a 
swinging bucket centrifuge (Sorvall model RT6000B table top centrifuge) at 750 x g for 1.5 
min at room temperature. Column-purified samples are dried under vacuum for about 40 
5 min and then dissolved in 4 ul of a DNA loading solution (83% deionized formamide, 8.3 
mM EDTA, and 1.6 mg/ml Blue Dextran). The samples are then heated to 90°C for two 
min, and held at 4°C until loading. 1.5 ul of each sample is loaded into a single well of the 
ABI377 sequencer. Sequence chromatogram data files from the ABI377 are analyzed with 
the computer program Sequencher (Gene Codes, Ann Arbor, MI), for assembly of sequence 
10 fragments and correction of ambiguous base calls. Generally sequence reads of 600-700 bp 
are obtained. Potential sequencing errors are minimized by obtaining sequence 
information from both DNA strands and by re-sequencing difficult areas using primers at 
different locations until all sequencing ambiguities are removed. 

The entire coding region of the polymerase genes from both the parent strains and 
15 the resistant viruses are sequenced. The DNA sequencing is done using viral DNA as the 
template thus avoiding cloning of the polymerase genes. The amino acid sequence of the 
DNA polymerases of HSV-1 KOS, F, Patton and DJL and HSV-2 MS and 186 are 
compared in Figure 4. Amino acids that are identical for the six polymerases are shaded in 
black while regions where amino acid differences are found are shaded in gray. The amino 
20 acid sequence of the four HSV-1 polymerases are essentially identical with only a few 
minor changes noted between the different HSV-1 strains. The majority of amino acid 
changes are found when the sequences of the HSV-1 and HSV-2 polymerases are 
compared. 

25 Isolation and Characterization of HSV-1 and HSV-2 Mutants That Are Resistant To 
the 4-oxo-DH0 9 s and 4-oxo-DHTP Compounds 

A panel of viruses consisting of four strains of HSV-1 (KOS, F, DJL, Patton) and 
three strains of HSV-2 (MS, 35D, 186) are tested in a plaque reduction assay against four 
different 4-oxo-DHQ compounds (# 1, 2, 4, 5 as shown in Figure 1), and one 4-oxo-DHTP 
30 compound (# 3 as shown in Figure 1) and against Acyclovir. The six drugs inhibited 

replication of the seven virus strains with IC 50 values ranging from 2-10 |iM (Table 1). In 
order to select for 4-oxo-DHQ resistant mutants, HSV-1 strains KOS, F, and DJL along 
with HSV-2 strains 186 and MS are serially passaged in the presence of 20 uM compound 
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1. Following the seventh passage, 4-oxo-DHQ resistant virus from each strain are plaque 
purified three times and high-titer stocks are made. All of the resistant HSV mutants grew 
to high titers in Vero cells, indicating that the mutations in the resistant isolates did not 
significantly impair their growth. The mutants selected with 4-oxo-DHQ compound 1 
5 exhibited >10 fold increase in IC 50 when tested in a plaque reduction assay against 4-oxo- 
DHQ compound 1 Data are shown in Table 2. 



Table 2 

4-oxo-DHQ Resistant Virus of HSV-1 and HSV-2 



Virus Mutants 


Compound 1 

IC 50 (uM) 


Amino Acid Change in HSV 
DNA Polymerase 


HSV-1 Kos-Ml 


>20 


- V823A 


HSV-1 F-Ml 


>20 


- V823A 


HSV-1 DJL-M1 


>20 


-V823A 


HSV-2 MS-MI 


>20 


- V826A 


HSV-2 186-M1 


>20 


- V828A 



10 *HSV-1 and HSV-2 isolates grown in the presence of 4-oxo-DHQ select for resistant virus. 

DNA sequence analysis of the 4-oxo-DHQ resistant mutants (HSV-1 KOS-M1, 
HSV-1 F-Ml, HSV-1 DJL-M1, HSV-2 186-M1, HSV-2 MS-MI) demonstrated that all five 
mutants contained a single point mutation of T to C at the binding domain resulting in a 
Valine to Alanine amino acid change. 

15 

Isolation and Characterization of A HCMV Mutant That Is Resistant to The 4-oxo- 
DHO's and 4-oxo-DHTP Compounds 

In order to select for a 4-oxo-DHQ HCMV resistant mutant, virus (strain AD 169) is 
serially passaged in the presence of 20 uM a 4-oxo-DHQ. Although we could readily select 

20 for HSV mutants using this procedure we failed to isolate an HCMV mutant, even when the 
virus is passaged at low drug concentrations (<5 uM). Comparison of the amino acid 
sequence of the HSV polymerase, Y-G-F-T-G-V-Q-H-G, and HCMV polymerase, Y-G-F- 
T-G-V-V-N-G, in the region of amino acid 823 (underlined amino acid) showed that there 
is a second valine at position 824 in the HCMV polymerase. In order to determine if both 

25 valines need to be changed in order to confer resistance to the 4-oxo-DHQ's, in vitro 

polymerase assays are done using mutant HCMV polymerases containing either V823A or 
V823A plus V824L (Table 3). 
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Table 3 

HCMV Mutant Polymerase Exhibits Resistance to 4-oxo-DHQ 



Polymerase 


Compound 1 IC50 (uM) 1 


HCMV (wild) 


4.6 


HCMV V823A 


17.2 


HCMV V823A/V824L 


42.9 1 



^Generation of the valine to alanine at amino acid 823 of HCMV results in a 3.5-fold increase in resistance. 
*Mutation of the amino acid from valine to alanine and amino acid 824 from valine to leucine results in an 9- 
fold increase in resistance, relative to wild type. 

10 The V823A alone resulted in a 3.5-fold increase in the IC 5 o while the polymerase 

with the double amino acid change had nearly 10-fold increase in the IC 5 o. In order to 
isolate an HCMV resistant mutant marker rescue experiments are done. Plasmids 
containing the mutant polymerase genes are transfected into HFF cells along with wild type 
HCMV AD 169 DNA. The resulting virus is then serially passaged in the presence of 20 

15 uM compound 1 (see figure 1). A 4-oxo-DHQ resistant virus is isolated from marker 

rescue studies done with the HCMV polymerase gene containing mutations that result in the 
V823A, V824L amino acid changes, but not with the gene containing V823A change 
alone. The mutant selected with compound 1 (HCMV AD169-M1) exhibited -7-fold 
increase in IC 50 when tested in a plaque reduction assay compared to Ganciclovir and 

20 cidofovir which has a < 2-fold change in sensitivity (Table 4). 

Table 4 

Plaque reduction assay of 4-oxo-DHQ resistant HCMV* 



Drug 


HCMV AD169 

ICso(liM) 


HCMV AD169 - Ml 
ICso (MM) 


Compound 1 


0.7 


4.7 


Ganciclovir 


0.9 


1.0 


Cidofovir 


0.3 


0.6 



25 *Recombination of wild-type HCMV with a polymerase gene containing the valine to alanine at amino acid 
823 and the valine to leucine at amino acid 824 allowed for selection of resistant virus with about 7-fold less 
sensitivity to compound 1. 

^Sensitivity of resistant HCMV virus to Ganciclovir and Cidofovir verifies that the 4-oxo-DHQ's mechanism 
for inhibiting the polymerase protein is unique 
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The entire coding region of the HCMV polymerase genes from both the parent 
strain and the resistant virus are sequenced. The DNA sequencing is again done using viral 
DNA as the template thus avoiding cloning of the polymerase genes. Comparison of the 
DNA sequence of the two polymerase genes demonstrated that the resistant mutant 
5 contained two point mutations that resulted in the predicted V823 A, V824L amino acid 
changes. As with the HS V resistant viruses these results demonstrate the critical role of the 
region encompassing amino acid 823 for inhibition of polymerase activity by these 
compounds. 



10 Antiviral Activity of Nucleoside and Non-Nucleoside Polymerase Inhibitors Against 4- 
oxo-DHQ Resistant Mutants 

In order to determine if the 4-HQ binding domain mutations alter the sensitivity of 
the HSV-1, HSV-2 and HCMV mutants to both non-nucleoside (4-oxo-DHQ's) and 
nucleoside inhibitors (e.g Acyclovir and ganciclovir) several of the mutants are tested in 

15 plaque reduction assays against a series of non-nucleoside compounds including Foscarnet 
(PFA), 4-HQ's 4-oxo-DHQ's and 4-oxo-DHTP's (Table 5). The mutants are also tested 
against a series of nucleoside inhibitors including acyclovir and ganciclovir (Table 5). The 
activity of these compounds against the mutants is compared to their activity against the 
wild type strains that are used to isolate the HS V and HCMV mutants. When tested against 

20 a number of 4-HQ's, 4-oxo-DHQ's and 4-oxo-DHTPs and other related classes of 

compounds all of the drugs are found to inhibit the wild type virus with IC 5 o values ranging 
from <0.1 uM to 30 uM. When these drugs are tested against the resistant viruses they are 
found to have IC 50 values 5 to 10 fold higher then the parent virus. There is little if any 
difference in the IC 50 values of the nucleoside compounds and the non-nucleoside PFA 

25 between the wild type and mutant HSV-1, HSV-2, and HCMV viruses. These results 
demonstrate that the amino acid change in the binding domain (V823A in the HSV-1 
polymerase, V826A in the HSV2-MS polymerase, V828A in the HSV2-186 polymerase, 
and the V823A/V824L changes in the HCMV polymerase) resulted in resistance to the 4- 
oxo-DHQ's and 4-oxo-DHTP's, which provides further evidence that these classes of 

30 compounds share an affinity for a region we refer to as the binding domain. In contrast, 
these amino acid changes did not alter the activity of these viruses to other classes of 
polymerase inhibitors. 
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Table 5 

Antiviral activity of nucleoside and non-nucleoside polymerase inhibitors 



against HSV-1, HSV-2, and HCMV Isolates selected for 4-oxo-DHQ resistance* 





Plaque Reduction Assay - IC50 (|iM) 




HSV-2 


HSV-2 


HSV-1 


HSV-1 


HCMV 


HCMV 


Drug 


MS 


MS-MI 


KOS 


KOS-M1 


AD169 


AD169-M1 


6 


28.8 


>50 


24.6 


>50 


5.1 


>16 


7 


8.8 


27.9 


6.5 


>50 


0.3 


3.4 


8 


2.3 


>50 


5.1 


>50 


<0.1 


1.1 


9 


0.9 


48.7 


1.9 


>50 


<0.1 


3.1 


10 


29.2 


>50 


15.8 


>50 


1.1 


>16 


11 


3.0 


>50 


3.1 


>50 


0.7 


3.9 


12 


0.4 


12.5 


1.3 


>50 


0.2 


1.1 


13 


5.3 


>50 


5.5 


<25 


2.7 


>16 


14 


1.6 


>50 


28.4 


>50 


0.9 


18.4 


2 


1.3 


>50 


3.3 


>50 


0.4 


4.0 


4 


2.1 


28.4 


4.2 


>50 


0.6 


2.1 


3 


0.8 


>50 


4.0 


>50 


1.5 


6.2 


15 


5.9 


>50 


>50 


>50 


0.7 


7.7 


Iudr 


5.0 


6.1 


1.1 


0.8 


ND 


ND 


Bvdu 


5.8 


5.9 


2.1 


0.1 


ND 


ND 


ACV 


2.4 


2.8 


3.9 


4.4 


ND 


ND 


AraC 


0.2 


0.1 


0.2 


0.2 


ND 


ND 


AraT 


6.6 


3.6 


11.6 


3.6 


ND 


ND 


AraA 


10.6 


18.2 


26.1 


27.2 


ND 


ND 


GCVir 


ND 


ND 


ND 


ND 


0.8 


0.8 


CDV 


ND 


ND 


ND 


ND 


0.4 


0.3 


PFA 


ND 


ND 


ND 


ND 


38 


<20 



5 *HSV-2 MS, HSV-1 KOS, HCMV AD169: wild type strains 

*HSV-2 MS-MI, HSV-1 KOS-M1, HCMV AD169-M1: mutants selected for 4-oxo-DHQ resistance 

*ND- Not Done- 
Antiviral compounds identified by the present invention can conveniently be 
10 administered in a pharmaceutical composition containing the compound in combination 

with a suitable excipient, the composition being useful in combating viral infections. 

Pharmaceutical compositions containing a compound appropriate for antiviral use are 

prepared by methods and contain excipients which are well known in the art. A generally 

recognized compendium of such methods and ingredients is Remington's Pharmaceutical 
15 Sciences by E.W. Martin (Mark PubL Co., 15th Ed., 1975). 

Antiviral compounds identified by the present invention and their compositions can 

be administered parenterally (for example, by intravenous, intraperitoneal or intramuscular 
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injection), topically, orally, or rectally, depending on whether the preparation is used to treat 
internal or external viral infections. 

For oral therapeutic administration, the active compound may be combined with one 
or more excipients and used in the form of ingestible tablets, buccal tablets, troches, 

5 capsules, elixirs, suspensions, syrups, wafers, and the like. Such compositions and 
preparations should contain at least 0. 1% of active compound. The percentage of the 
compositions and preparations may, of course, be varied and may conveniently be between 
about 2 to about 60% of the weight of a given unit dosage form. The amount of active 
compound in such therapeutically useful compositions is such that an effective dosage level 

10 will be obtained. 

The tablets, troches, pills, capsules, and the like may also contain the following: 
binders such as gum tragacanth, acacia, corn starch or gelatin; excipients such as dicalcium 
phosphate; a disintegrating agent such as corn starch, potato starch, alginic acid and the 
like; a lubricant such as magnesium stearate; and a sweetening agent such as sucrose, 

15 fructose, lactose or aspartame or a flavoring agent such as peppermint, oil of wintergreen, 
or cherry flavoring may be added. When the unit dosage form is a capsule, it may contain, 
in addition to materials of the above type, a liquid carrier, such as a vegetable oil or a 
polyethylene glycol. Various other materials may be present as coatings or to otherwise 
modify the physical form of the solid unit dosage form. For instance, tablets, pills, or 

20 capsules may be coated with gelatin, wax, shellac or sugar and the like. A syrup or elixir 
may contain the active compound, sucrose or fructose as a sweetening agent, methyl and 
propylparabens as preservatives, a dye and flavoring such as cherry or orange flavor. Of 
course, any material used in preparing any unit dosage form should be pharmaceutically 
acceptable and substantially non-toxic in the amounts employed. In addition, the active . 

25 compound may be incorporated into sustained-release preparations and devices. 

Antiviral compounds identified by the present invention and their compositions can 
also be administered intravenously or intraperitoneally by infusion or injection. Solutions 
of the active compound or its salts can be prepared in water, Optionally mixed with a 
nontoxic surfactant. Dispersions can also be prepared in glycerol, liquid polyethylene 

30 glycols, triacetin, and mixtures thereof and in oils. Under ordinary conditions of storage 

and use, these preparations contain a preservative to prevent the growth of microorganisms. 

Pharmaceutical dosage forms suitable for injection or infusion can include sterile 
aqueous solutions or dispersions or sterile powders comprising the active ingredient which 
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are adapted for the extemporaneous preparation of sterile injectable or infusible solutions or 
dispersions, optionally encapsulated in liposomes. In all cases, the ultimate dosage form 
should be sterile, fluid and stable under the conditions of manufacture and storage. The 
liquid carrier or vehicle can be a solvent or liquid dispersion medium comprising, for 

5 example, water, ethanol, a polyol (for example, glycerol, propylene glycol, liquid 

polyethylene glycols, and the like), vegetable oils, nontoxic glyceryl esters, and suitable 
mixtures thereof. The proper fluidity can be maintained, for example, by the formation of 
liposomes, by the maintenance of the required particle size in the case of dispersions or by 
the use of surfactants. The prevention of the action of microorganisms can be brought 

10 about by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, 
phenol, sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, buffers or sodium chloride. Prolonged absorption of 
the injectable compositions can be brought about by the use in the compositions of agents 
delaying absorption, for example, aluminum monostearate and gelatin. 

15 Sterile injectable solutions can be prepared by incorporating the active compound in 

the required amount in the appropriate solvent with various of the other ingredients 
enumerated above, as required, followed by filter sterilization. In the case of sterile 
powders for the preparation of sterile injectable solutions, the preferred methods of 
preparation are vacuum drying and the freeze drying techniques, which yield a powder of 

20 the active ingredient plus any additional desired ingredient present in the previously sterile- 
filtered solutions. 

For topical administration, the present compounds may be applied in pure form, i.e., 
when they are liquids. However, it will generally be desirable to administer them to the 
skin as compositions or formulations, in combination with a dermatologically acceptable 

25 carrier, which may be a solid or a liquid. 

Useful solid carriers include finely divided solids such as talc, clay, microcrystalline 
cellulose, silica, alumina and the like. Useful liquid carriers include water, alcohols or 
glycols or water-alcohol/glycol blends, in which the present compounds can be dissolved or 
dispersed at effective levels, optionally with the aid of non-toxic surfactants. Adjuvants 

30 such as fragrances and additional antimicrobial agents can be added to optimize the 

properties for a given use. The resultant liquid compositions can be applied from absorbent 
pads, used to impregnate bandages and other dressings, or sprayed onto the affected area 
using pump-type or aerosol sprayers. Thickeners such as synthetic polymers, fatty acids, 
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fatty acid salts and esters, fatty alcohols, modified celluloses or modified mineral materials 
can also be employed with liquid carriers to form spreadable pastes, gels, ointments, soaps, 
and the like, for application directly to the skin of the user. 

Examples of useful dermatological compositions which can be used to deliver the 
5 compounds of formula I to the skin are known to the art; for example, see Jacquet et ah 
(U.S. Pat. No. 4,608,392), Geria (U.S. Pat. No. 4,992,478), Smith et al. (U.S. Pat. 
No. 4,559,157) and Wortzman (U.S. Pat. No. 4,820,508). 

Useful dosages of the compounds of formula I can be determined by comparing 
their in vitro activity, and in vivo activity in animal models. Methods for the extrapolation 
10 of effective dosages in mice, and other animals, to humans are known to the art; for 
example, see U.S. Pat. No. 4,938,949. 

The compound is conveniently administered in unit dosage form; for example, 
containing 5 to 1000 mg, conveniently 10 to 750 mg, most conveniently, 50 to 500 mg of 
active ingredient per unit dosage form. The desired dose may conveniently be presented in 
15 a single dose or as divided doses administered at appropriate intervals, for example, as two, 
three, four or more sub-doses per day. The sub-dose itself may be further divided, e.g., into 
a number of discrete loosely spaced administrations; such as multiple inhalations from an 
insufflator or by application of a plurality of drops into the eye. 

For internal infections, the compositions can be administered orally or parenterally 
20 at dose levels, calculated as the free base, of about 0. 1 to 300 mg/kg, preferably 1.0 to 30 

mg/kg of mammal body weight, and can be used in man in a unit dosage form, administered 
one to four times daily in the amount of 1 to 1000 mg per unit dose. 

For parenteral administration or for administration as drops, as for eye infections, 
the compounds are presented in aqueous solution in a concentration of from about 0.1 to 
25 about 10%, more preferably about 0.1 to about 7%. The solution may contain other 
ingredients, such as emulsifiers, antioxidants or buffers. 

Generally, the concentration of the compound(s) of formula I in a liquid 
composition, such as a lotion, will be from about 0. 1-25 wt-%, preferably from about 0.5-10 
wt-%. The concentration in a semi-solid or solid composition such as a gel or a powder 
30 will be about 0.1-5 wt-%, preferably about 0.5-2.5 wt-%. 

The exact regimen for administration of the compounds and compositions disclosed 
herein will necessarily be dependent upon the needs of the individual subject being treated, 
the type of treatment and, of course, the judgment of the attending practitioner. 



-20- 



WO 02/06513 PCT/US01/16525 

The antiviral activity of a compound of the invention can be determined using 
pharmacological models which are well known to the art, or using Test A described below. 

The compounds of formula (I) and pharmaceutical^ acceptable salts thereof are 
useful as antiviral agents. Thus, they are useful to combat viral infections in animals, 
including man. The compounds are generally active against herpes viruses, and are 
particularly useful against the varicella zoster virus, the Epstein-Barr virus, the herpes 
simplex virus, the human herpes virus type 8 (HHV-8) and the cytomegalovirus (CMV). 
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CLAIMS 

We claim: 

L A method of selecting compounds that inhibit herpes viruses comprising: 
a) measuring IC 50 of a compound of interest that inhibits a wild type herpes virus, 
5 b) measuring IC 50 of the same compound that inhibits a binding domain mutant herpes 
virus which is the same strain as the wild type herpes virus, 

c) comparing IC 5 o of step a with IC50 of step b; and 

d) selecting the compound of interest wherein the IC50 of step b is at least 3 times 
greater than the IC50 of step a. 

10 

2. A method of selecting compounds that inhibit herpes viruses comprising: 

a) measuring IC 50 of a compound of interest that inhibits a binding domain mutant 
herpes virus, 

b) measuring IC 50 of the same compound that inhibits a wild type herpes virus which is 
15 the same strain as the mutant herpes virus, 

c) comparing IC 50 of step a with IC50 of step b; and 

d) selecting the compound of interest wherein the IC50 of step a is at least 3 times 
greater than the IC50 of step b. 

20 3, The method of claim 1 or 2 wherein the herpes virus is HSV-1, HSV-2, HCMV, 
VZV, EBV, orHHV~8. 

4. A method of selecting compounds that inhibit herpes viruses comprising: 
a) measuring IC50 of a compound of interest that inhibits a wild type HSV-1, 

25 b) measuring IC50 of the same compound that inhibits a binding domain mutant HSV-1 
which is the same strain as the wild type herpes virus, 

c) comparing IC50 of step a with IC50 of step b; and 

d) selecting the compound of interest wherein the IC50 of step b is at least 3 times 
greater than the IC50 of step a. 

30 

5. A method of selecting compounds that inhibit herpes viruses comprising: 

a) measuring IC 50 of a compound of interest that inhibits a binding domain mutant 
HSV-1, 
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b) measuring IC 50 of the same compound that inhibits a wild type herpes virus which is 
the same strain as the mutant HSV-1, 

c) comparing IC50 of step a with IC50 of step b; and 

d) selecting the compound of interest wherein the IC 50 of step a is at least 3 times 
5 greater than the IC 50 of step b. 

6. The method of claim 4 or 5 wherein HSV-1 is HSV-1 KOS, HSV-1 F, HSV-1 DJL 
or HSV-1 Patton. 

10 7. The method of claim 5 or 6 wherein the mutation of a wild type herpes virus to 
mutant herpes virus is at amino acid 823 from valine to alanine. 

8. A method of selecting compounds that inhibit herpes viruses comprising: 
a) measuring IC 50 of a compound of interest that inhibits a wild type HS V-2, 

15 b) measuring IC 50 of the same compound that inhibits a binding domain mutant HS V-2 
which is the same strain as the wild type herpes virus, 

c) comparing IC 50 of step a with IC50 of step b; and 

d) selecting the compound of interest wherein the IC 50 of step b is at least 3 times 
greater than the IC50 of step a. 

20 

9. A method of selecting compounds that inhibit herpes viruses comprising: 

a) measuring IC 50 of a compound of interest that inhibits a binding domain mutant 
HSV-2, 

b) measuring IC 50 of the same compound that inhibits a wild type herpes virus which is 
25 the same strain as the mutant HSV-2, 

c) comparing IC50 of step a with IC50 of step b; and 

d) selecting the compound of interest wherein the IC 50 of step a is at least 3 times 
greater than the IC 50 of step b. 

30 10. The method of claim 8 or 9 wherein HSV-2 is HSV-2 MS, HSV-2 35D, or HSV-2 
186. 

11. A method of selecting compounds that inhibit herpes viruses comprising: 
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a) measuring IC 50 of a compound of interest that inhibits a wild type HCMV, 

b) measuring IC 50 of the same compound that inhibits a binding domain mutant 
HCMV which is the same strain as the wild type herpes virus, 

c) comparing IC50 of step a with IC 50 of step b; and 

5 d) selecting the compound of interest wherein the IC 50 of step b is at least 3 times 
greater than the IC50 of step a. 

12. A method of selecting compounds that inhibit herpes viruses comprising: 

a) measuring IC 50 of a compound of interest that inhibits a binding domain mutant 
10 HCMV, 

b) measuring IC 50 of the same compound that inhibits a wild type herpes virus which is 
the same strain of the mutant HCMV, 

c) comparing IC 50 of step a with IC50 of step b; and 

d) selecting the compound of interest wherein the IC50 of step a is at least 3 times 
15 greater than the IC50 of step b. 

13. The method of claim 8 or 9 wherein HCMV is AD 1 69. 

14. The methods of claims 1, 4, 8, or 1 1 wherein IC50 of step b is at least 5 times greater 
20 than the IC 50 of step a. 

15. The methods of claims 2, 5, 9, or 12 wherein IC 50 of step a is at least 5 times greater 
than the IC50 of step b. 

25 16. A use of compounds for manufacturing of medicinals for selectively treating 
diseases caused by herpes viruses in a human host comprising administering a 
compound to a human in need of such treatment wherein said compound inhibits 
herpes viruses by interaction with the binding domain in the viral DNA polymerase. 

30 17. A use of compounds for manufacturing of medicinals for selectively inhibiting 

herpes viruses in a human host comprising administering a compound to a human in 
need of such treatment wherein IC 50 of the compound that inhibits a binding domain 
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mutant herpes virus is at lease 3 times greater than IC 50 of the compound that 
inhibits a wild type herpes virus which is the same strain as the mutant herpes virus. 



18. The use of claim 17 wherein IC 5 o of the compound that inhibits a binding domain 
5 mutant herpes virus is at lease 5 times greater than IC 50 of the compound that 

inhibits a wild type herpes virus which is the same strain as the mutant herpes 
viruse. 

19. The use of claim 17 wherein herpes viruses is HSV-1, HSV-2, HCMV, VZV, EBV, 
10 or HHV-S. 

20. A use of compounds for manufacturing of medicinals for treating herpesviral 
infections in a human host wherein IC 50 of the compound that inhibits a binding 
domain mutant herpes virus is at lease 5 times greater than IC 50 of the compound 

15 that inhibits a wild type herpes virus which is the same strain as the mutant herpes 

virus. 

21. A use of compounds for manufacturing of medicinals for treating herpesviral 
infections in a human host wherein said compound inhibits the herpesvirus by 

20 interacting with the binding domain in the viral DNA polymerase. 

22. The herpesviral infection of claim 20 or 21 which is HSV-1, HSV-2, HCMV, VZV, 
EBV, or HHV-8 infection. 

25 23. A compound for the inhibiting of herpesvirus DNA polymerases wherein passage of 
a wild type herpes virus in the presence of said compound results a change of the 
wild type HSV-1 polymerases at amino acid 823 from valine to alanine. 

24. A compound for inhibiting herpesvirus DNA polymerases wherein passage of a wild 
30 type herpes virus in the presence of said compound results in a change of the wild 

type HCMV polymerases at amino acid 823 from valine to alanine and at amino 
acid 824 from valine to leuline. 
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25. A mutant herpesvirus DNA molecule having a nucleotide sequence selected from a 
group consisting of SEQ.ID.NO. 1; SEQ.ID.NO. 3; SEQ.ID.NO. 5; SEQ.ID.NO. 7; 
SEQ.ID.NO. 9; and SEQ.ID.NO. 11. 

26. A mutant herpesvirus polymerase amino acid molecule having an amino acid 
sequence selected from a group consisting of SEQ.ID.NO. 2; SEQ.ID.NO. 4; 
SEQ.ID.NO. 6; SEQ.ID.NO. 8; SEQ.ID.NO. 10 and SEQ.ID.NO. 12. 
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4-HQ, 4-oxo-DHQ and 4-oxo-DHTP antiviral compounds 
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Figure 2. The HSV1 (KOS Strain) DNA Polymerase Amino Acid 823 is 
Critical for Resistance to 4-Hydroxyquinolines and Related Compounds 
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Schematic of HSV1 polymerase illustrating the conserved regions A and l-VI found in class 2 
polymerases. Also shown are the amino acid sequence for the highly conserved 
herpesvirus domain in region III which surrounds the HSV1 amino acid 823. 
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Figure 4 Comparison of Wild type HS V-l and HSV-2 DNA Polymerases Amino 



Acid Sequences Alligned by Amino Acid Homology* 

HSV2-MS MFCAAGGPTS PGGKSAARAA SGFFAPHNPR 

HSV2-186 MFCAAGGPAS PGGKSAARAA SGFFAPHNPR 

HSV1-KOS MFSGGGGPLS PGGKSAARAA SGFFAPAGPR 

HSVl-Patton MFSGGGGPLS PGGKSAARAA SGFFAPAGPR 

HSV1-DJL MFSGGGGPLS PGGKSAARAA SGFFAPAGPR 

HSV1-F MFSGGGGPLS PGGKSAARAA SGFFAPAGPR 



GATQTAPPPC 
GATQTAPPPC 
GAGR.GPPPC 
GAGR.GPPPC 
GAGR.GPPPC 
GAGR.GPPPC 



RRQNFYNPHL -50 
RRQNFYNPHL -50 
LRQNFYNPYL -49 
LRQNFYNPYL -49 
LRQNFYNPYL -49 
LRQNFYNPYL -49 



HSV2-MS 

HSV2-186 

HSVl-Kos 

HSVl-Patton 

HSV1-DJ1 

HSV1-F 

HSV2-MS 

HSV2-186 

HSV-Kos 

HSVl-Patton 

HSV1-DJL 

HSV1-F 

HSV2-MS 

HSV2-186 

HSV-Kos 

HSVl-Patton 

HSV1-DJL 

HSV1-F 

HSV2-MS 

HSV2-186 

HSV-Kos 

HSVl-Patton 

HSV1-DJL 

HSV1-F 



AQTGTQPKAP GPAQRHTYYS 
AQTGTQPKAP GPAQRHTYYS 
APVGTQQKPT GPTQRHTYYS 
APVGTQQKPT GPTQRHTYYS 
APVGTQQKPT GPTQRHTYYS 
APVGTQQKPT GPTQRHTYYS 

RRAPKVYCGG DERDVLRVGP 
RRAPKVYCGG DERDVLRVGP 
KRAPKVYCGG DERDVLRVGS 
KRAPKVYCGG DERDVLRVGS 
KRAPKVYCGG DERDVLRVGS 
KRAPKVYCGG DERDVLRVGS 

YDILEHVEHA YSMRAAQLHE 
YDILEHVEHA YSMRAAQLHE 
YDILENVEHA YGMRAAQFHA 
YDILENVEHA YGMRAAQFHA 
YDILENVEHA YGMRAAQFHA 
YDILENVEHA YGMRAAQFHA 

GTRQYFYMNK AEVDRHLQCR 

GTRQYFYMNK AEVDRHLQCR 

GTRQYFYMNK EEVDRHLQCR 

GTRQYFYMNK EEVDRHLQCR 

GTRQYFYMNK EEVDRHLQCR 

GTRQYFYMNK EEVDRHLQCR 



ECDEFRFIAP 
ECDEFRF I AP 
ECDEFRFIAP 
ECDEFRFIAP 
ECDEFRFIAP 
ECDEFRFIAP 

EGFWPRRLRL 
EGFWPRRLRL 
GGFWPRRSRL 
GGFWPRRSRL 
GGFWPRRSRL 
GGFWPRRSRL 

RFMDAITPAG 
RFMDAITPAG 
RFMDAITPTG 
RFMDAITPTG 
RFMDAITPTG 
RFMDAITPTG 

APRDLCERLA 
APRDLCERLA 
APRDLCERMA 
APRDLCERMA 
APRDLCERMA 
APRDLCERMA 



RSLDEDAPAE 
RSLDEDAPAE 
RVLDEDAPPE 
RVLDEDAPPE 
RVLDEDAPPE 
RVLDEDAPPE 

WGGADHAPKG 
WGGADHAPEG 
WGGVDHAPAG 
WGGVDHAPAG 
WGGVDHAPAG 
WGGVDHAPAG 

TVITLLGLTP 
TVITLLGLTP 
TVITLLGLTP 
TVITLLGLTP 
TVITLLGLTP 
TVITLLGLTP 

AALRESPGAS 
AALRESPGAS 
AALRESPGAS 
AALRESPGAS 
AALRESPGAS 
AALRESPGAS 



QRTGVHDGRL 
QRTGVHDGRL 
KRAGVHDGHL 
KRAGVHDGHL 
KRAGVHDGHL 
KRAGVHDGHL 

FDPTVTVFHV 
FDPTVTVFHV 
FNPTVTVFHV 
FNPTVTVFHV 
FNPTVTVFHV 
FNPTVTVFHV 

EGHRVAVHVY 
EGHRVAVHVY 
EGHRVAVHVY 
EGHRVAVHVY 
EGHRVAVHVY 
EGHRVAVHVY 

FRGISADHFE 
FRGISADHFE 
FRGISADHFE 
FRGISADHFE 
FRGISADHFE 
FRGISADHFE 



HSV2-MS AEWERADVY YYETRPTLYY RVFVRSGRAL AYLCDNFCPA IRK YEGG VDA 

HSV2-186 AEWERADVY YYETRPTLYY RVFVRSGRAL AYLCDNFCPA IRK YEGG VDA 

HSV-Kos AEWERTDVY YYETRPALFY RVYVRSGRVL SYLCDNFCPA IKK YEGG VDA 

HSVl-Patton AEWERTDVY YYETRPALFY RVYVRSGRVL SYLCDNFCPA IKK YEGG VDA 
HSV1-DJL AEWERTDVY YYETRPALFY RVYVRSGRVL SYLCDNFCPA I KKYEGGVDA 

HSV1-F AEWERTDVY YYETRPALFY RVYVRSGRVL SYLCDNFCPA I KKYEGGVDA 



-100 

-100 

-99 

-99 

-99 

-99 

-150 
-150 
-149 
-149 
-149 
-149 

-200 
-200 
-199 
-199 
-199 
-199 

-250 
-250 
-249 
-249 
-249 
-249 

-300 
-300 
-299 
-299 
-299 
-299 



HSV2-MS 

HSV2-186 

HSV-Kos 

HSVl-Patton 

HSV1-DJL 

HSV1-F 

HSV2-MS 

HSV2-186 

HSV-Kos 

HSVl-Patton 

HSV1-DJL 

HSV1-F 

HSV2-MS 

HSV2-186 

HSV-Kos 

HSVl-Patton 

HSV1-DJL 

HSV1-F 



TTRFILDNPG 
TTRFILDNPG 
TTRFILDNPG 
TTRFILDNPG 
TTRFILDNPG 
TTRFILDNPG 

LAVEGAMCDL 
LAVEGAMCDL 
LAI EGGMSDL 
LAI EGGMSDL 
LAIEGGMSDL 
LAI EGGMSDL 

DLSTTALEHI 
DLSTTALEHI 
DLSTTALEHV 
DLSTTALEHV 
DLSTTALEHV 
DLSTTALEHV 



FVTFGWYRLK 
FVTFGWYRLK 
FVTFGWYRLK 
FVTFGWYRLK 
FVTFGWYRLK 
FVTFGWYRLK 

PAYKLMCFDI 
PAYKLMC FD I 
PAYKLMCFDI 
PAYKLMCFDI 
PAYKLMCFDI 
PAYKLMCFDI 

LLFSLGSCDL 
LLFSLGSCDL 
LLFSLGSCDL 
LLFSLGSCDL 
LLFSLGSCDL 
LLFSLGSCDL 



PGRGNAPAQP 
PGRGNAPAQP 
PGRNNTLAQP 
PGRNNTLAQP 
PGRNNTLAQP 
PGRNNTLAQP 

ECKAGGEDEL 
ECKAGGEDEL 
ECKAGGEDEL 
ECKAGGEDEL 
ECKAGGEDEL 
ECKAGGEDEL 

PESHLSDLAS 
PESHLSDLAS 
PESHLNELAA 
PESHLNELAA 
PESHLNELAA 
PESHLNELAA 



RPPTAFGTSS 
RPPTAFGTSS 
RAPMAFGTSS 
RAPMAFGTSS 
RAPMAFGTSS 
RAPMAFGTSS 

AFPVAERPED 
AFPVAERPED 
AFPVAGHPED 
AFPVAGHPED 
AFPVAGHPED 
AFPVAGHPED 

RGLPAPWLE 
RGLPAPWLE 
RGLPTPWLE 
RGLPTPWLE 
RGLPTPWLE 
RGLPTPWLE 



DVEFNCTADN 
DVEFNCTADN 
DVEFNCTADN 
DVEFNCTADN 
DVEFNCTADN 
DVEFNCTADN 

LVIQISCLLY 
LVIQISCLLY 
LVIQISCLLY 
LVIQISCLLY 
LVIQISCLLY 
LVIQISCLLY 

FDSEFEMLLA 
FDSEFEMLLA 
FDSEFEMLLA 
FDSEFEMLLA 
FDSEFEMLLA 
FDSEFEMLLA 



-350 
-350 
-349 
-349 
-349 
-349 

-400 
-400 
-399 
-399 
-399 
-399 

-450 
-450 
-449 
-449 
-449 
-449 
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HSV2-MS FMTFVKQYGP 

HSV2-186 FMTFVKQYGP 

HSV-Kos FMTLVKQYGP 

HSV1 - Pa 1 1 on FMTLVKQYGP 

5 HSV1-DJL FMTLVKQYGP 

HSV1 -F FMTLVKQYGP 

HSV2-MS RVWDIGQSHF 

HSV2 - 1 8 6 RVWDIGQSHF 

10 HSV-Kos RVWDIGQSHF 

HSVl-Patton RVWDIGQSHF 

HSVl-DJL RVWDIGQSHF 

HSV1-F RVWDIGQSHF 



EFVTGYNIIN FDWPFVLTKL 

EFVTGYNIIN FDWPFVLTKL 

EFVTGYNIIN FDWPFLLAKL 

EFVTGYNIIN FDWPFLLAKL 

EFVTGYNIIN FDWPFLLAKL 

EFVTGYNIIN FDWPFLLAKL 

QKRSKIKVNG MVNIDMYGII 
QKRSKIKVNG MVNIDMYGII 
QKRSKIKVNG MVNIDMYGII 
QKRSKIKVNG MVNIDMYGII 
QKRSKIKVNG MVNIDMYGII 
QKRSKIKVNG MVNIDMYGII 



TEIYKVPLDG YGRMNGRGVF -500 
TEIYKVPLDG YGRMNGRGVF -500 
TDIYKVPLDG YGRMNGRGVF -499 
TDIYKVPLDG YGRMNGRGVF -499 
TDIYKVPLDG YGRMNGRGVF -499 
TDIYKVPLDG YGRMNGRGVF -499 

TDKVKLSSYK LNAVAEAVLK -550 
TDKVKLSSYK LNAVAEAVLK -550 
TDKIKLSSYK LNAVAEAVLK -549 
TDKIKLSSYK LNAVAEAVLK -549 
TDKIKLSSYK LNAVAEAVLK -549 
TDKIKLSSYK LNAVAEAVLK -549 



15 HSV2-MS DKKKDLSYRD 
HSV2-186 DKKKDLSYRD 
HS V- Ko s DKKKDLSYRD 
HSVl-Patton . DKKKDLSYRD 
HSV1-DJL DKKKDLSYRD 

20 HSV1-F DKKKDLSYRD 



I PAYYASGPA QRGVIGEYCV 

I PAYYAS G PA QRGVIGEYCV 

I PAYYAAGPA QRGVIGEYCI 

I PAYYAAGPA QRGVIGEYCI 

I PTYYAAGPA QRGVIGEYCI 

I P AYYAAG PA QRGVIGEYCI 



QDSLLVGQLF FKFLPHLELS -600 
QDSLLVGQLF FKFLPHLELS -600 
QDSLLVGQLF FKFLPHLELS -599 
QDSLLVGQLF FKFLPHLELS -599 
QDSLLVGQLF FKFLPHLELS -599 
QDSLLVGQLF FKFLPHLELS -599 



HSV2-MS AVARL AG INI TRTIYDGQQI 

HSV2-186 AVARLAGINI TRTIYDGQQI 

HSV-Kos AVARLAGINI TRTIYDGQQI 

25 HSVl-Patton AVARLAGINI TRTIYDGQQI 

HSVl-DJL AVARLAGINI TRTIYDGQQI 

HSV1-F AVARLAGINI TRTIYDGQQI 

HSV2-MS APKRPAVPRG EGERPGDGNG 

30 HSV2-186 APKRPAVPRG EGERPGDGNG 

HSV-Kos APKRPAAARE DEERP 

HSVl-Patton APKRPAAARE DEERP 

HSVl-DJL APKRPAAARE DEERP 

HSV1-F APKRPAAARE DEERP 

35 

HSV2-MS GYQGARVLDP TSGFHVDPW 

HSV2-186 GYQGARVLDP TSGFHVDPW 

HSV-Kos GYQGARVLDP TSGFHVNPW 

HSVl-Patton GYQGARVLDP ISGFHVNPW 

40 HSV1-DJL GYQGARVLDP TSGFHVNPW 

HSV1-F GYQGARVLDP TSGFHVNPW 

HSV2-MS HLEADRDYLE IEVGGRRLFF 

HSV2-186 HLEADRDYLE IEVGGRRLFF 

45 HSV-Kos HLEAGKDYLE IEVGGRRLFF 

HSVl-Patton HLEAGKDYLE IEVGGRRLFF 

HSV1-DJL HLEAGKDYLE IEVGGRRLFF 

HSV1-F HLEAGKDYLE IEVGGRRLFF 

50 HSV2-MS STPEEAVLLD KQQAAI KWC 

HSV2-186 SPPEEAVLLD KQQAAI KWC 

HSV-Kos SSPEEAVLLD KQQAAI KWC 

HSVl-Patton SSPEEAVLLD KQQAAI KWC 

HSV1-DJL SSPEEAVLLD KQQAAI KWC 

55 HSV1-F SSPEEAVLLD KQQAAI KWC 

HSV2-MS LLATRAYVHA RWAEFDQLLA 

HSV2-186 LLATRAYVHA RWAEFDQLLA 

HSV-Kos LLATREYVHA RWAAFEQLLA 

60 HSVl-Patton LLATREYVHA RWAAFEQLLA 

HSV1-DJL LLATREYVHA RWAAFEQLLA 

HSV1-F LLATREYVHA RWAAFEQLLA 

HSV2-MS RGLTAAGLVA MGDKMASHIS 

65 HSV2-186 RGLTAAGLVA MGDKMASHIS 

HSV-Kos RGLTAAGLTA MGDKMASHIS 

HSVl-Patton RGLTAAGLTA MGDKMASHIS 



RVFTC LLRLA GQKGFILPDT QGRFRGLDKE -650 
RVFTCLLRLA GQKGFILPDT QGRFRGLDKE -650 
RVFTCLLRLA DQKGF ILPDT QGRFRGAGGE -649 
RVFTCLLRLA DQKGF ILPDT QGRFRGAGGE -649 
RVFTCLLRLA DQKGF ILPDT QGRFRGAGGE -649 
RVFTCLLRLA DQKGFILPDT QGRFRGGGGE -649 

DEDKDDDE . . DEDGDERE . E VARETGGRHV -697 

DEDKDDDEDG DEDGDERE . E VARETGGRHV -697 

EEEGEDEDER EEGGGEREPE GARETAGRHV -694 

EEEGEDEDER EEGGGEREPE GARETAGRHV -694 

EEEGEDENER EEGGGEREPE GARETAGRHV -694 

EEEGEDEDER EEGGGEREPE GARETAGRHV -694 

VFDFASLYPS IIQAHNLCFS TLSLRPEAVA -7 47 
VFDFASLYPS IIQAHNLCFS TLSLRPEAVA -749 
VFDFASLYPS IIQAHNLCFS TLSLRADAVA -744 
VFDFASLYPS IIQAHNLCFS TLSLRADAVA -744 
VFDFASLYPS IIQAHNLCFS TLSLRADAVA -744 
VFDFASLYPS IIQAHNLCFS TLSLRADAVA -744 

VKAHVRESLL SILLRDWLAM RKQIRSRIPQ -797 
VKAHVRESLL SILLRDWLAM RKQIRSRIPQ -799 
VKAHVRESLL SILLRDWLAM RKQIRSRIPQ -794 
VKAHVRESLL SILLRDWLAM RKQIRSRIPQ -794 
VKAHVRESLL SILLRDWLAM RKQIRSRIPQ -794 
VKAHVRESLL SILLRDWLAM RKQIRSRIPQ -794 

NSVYGFTGVQ HGLLPCLHVA ATVTT I GREM -847 
NSVYGFTGVQ HGLLPCLHVA ATVTTIGREM -849 
NSVYGFTGVQ HGLLPCLHVA ATVTTIGREM -844 
NSVYGFTGVQ HGLLPCLHVA ATVTTIGREM -844 
NSVYGFTGVQ HGLLPCLHVA ATVTTIGREM -844 
NSVYGFTGVQ HGLLPCLHVA ATVTTIGREM -844 

DFPEAAGMRA PGPYSMRIIY GDTDSIFVLC -897 
DFPEAAGMRA PGPYSMRIIY GDTDSIFVLC -899 
DFPEAADMRA PGPYSMRIIY GDTDSIFVLC -894 
DFPEAADMRA PGPYSMRIIY GDTDSIFVLC -894 
DFPEAADMRA PGPYSMRIIY GDTDSIFVLC -894 
DFPEAADMRA PGPYSMRIIY GDTDSIFVLC -894 

RALFLPPIKL ECEKTFTKLL L I AKKKYIGV -947 
RALFLPPIKL ECEKTFTKLL LI AKKKYIGV -949 
RALFLPPIKL ECEKTFTKLL L I AKKKYIGV -944 
RALFLPPIKL ECEKTFTKLL LI AKKKYIGV -944 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



HSV1-DJL 
HSV1-F 

HSV2 -MS 

HSV2-186 

HSV-Kos 

HSV1- Pattern 

HSV1-DJL 

HSVl-F 

HSV2 -MS 

HSV2-186 

HSV-Kos 

HSVl-Patton 

HSV1-DJL 

HSVl-F 

HSV2-MS 

HSV2-186 

HSV-Kos 

HSVl-Patton 

HSV1-DJL 

HSVl-F 

HSV2-MS 

HSV2-18 6 

HSV-Kos 

HSVl-Patton 

HSV1-DJL 

HSVl-F 

HSV2-MS 

HSV2-186 

HSV-Kos 

HSVl-Patton 

HSV1-DJL 

HSVl-F 

HSV2-MS 

HSV2-186 

HSV-Kos 

HSVl-Patton 

HSV1-DJL 

HSVl-F 



RGLTAAGLTA VGDKMAS HIS RALFLPPIKL ECEKTFTKLL LIAKKKYIGV 
RGLTAAGLTA VGDKMASHIS RALFLSPIKL ECEKTFTKLL LIAKKKYIGV 



ICGGKMLIKG 
ICGGKMLIKG 
IYGGKMLIKG 
IYGGKMLIKG 
IYGGKMLIKG 
IYGGKMLIKG 



VDLVRKNNCA 
VDLVRKNNCA 
VDLVRKNNCA 
VDLVRKNNCA 
VDLVRKNNCA 
VDLVRKNNCA 



AEEWLARPLP EGLQAFGAVL 
AEEWLARPLP EGLQAFGAVL 
AEEWLARPLP EGLQAFGAVL 
AEEWLARPLP EGLQAFGAVL 
AEEWLARPLP EGLQAFGAVL 
AEEWLARPLP EGLQAFGAVL 

TNKRLAHLTV YYKLMARRAQ 
TNKRLAHLTV YYKLMARRAQ 
TNKRLAHLTV YYKLMARRAQ 
TNKRLAHLTV YYKLMARRAQ 
TNKRLAHLTV YYKLMARRAQ 
TNKRLAHLTV YYKLMARRAQ 

ELDAAAPGDE PAPPAALPSP 

ELDAAAPGDE PAPPAALPSP 

ELDAAAPGDE PAPPAALPSP 

ELDAAAPGDE PAPPAALPSP 

ELDAAAPGDE PAPPAALPSP 

ELDAAAPGDE PAPPAALPSP 



DPGYAIARGV 
DPGYAIARGV 
DPAYAIAHGV 
DPAYAIAHGV 
DPAYAIAHGV 
DPAYAIAHGV 

TWHP PDDVAA 
TWHPPDDVAA 
VWHPPDDVAA 
VWHPPDDVTA 
VWHPPDDVAA 
VWHPPDDVAA 



PLNTDYYFSH 
PLNTDYYFSH 
ALNTDYYFSH 
ALNTDYYFSH 
ALNTDYYFSH 
ALNTDYYFSH 

RLRAAGFGPA 
RLRAAGFGPA 
RLRAAGFGAV 
RLRAAGFGAV 
RLRTAGFGAV 
RLRAAGFGAV 



FINRTSRALV 
FINRTSRALV 
FINRTSRALV 
FINRTSRALV 
FINRTSRALV 
FINRTSRALV 

VDAHRRITDP 
VDAHRRI TDP 
VDAHRRITDP 
VDAHRRITDP 
VDAHRRITDP 
VDAHRRITDP 

VPSIKDRIPY 
VPSIKDRIPY 
VPSIKDRIPY 
VPSIKDRIPY 
VPSIKDRIPY 
VPSIKDRIPY 

AKRPRETPSH 
AKRPRETPSH 
AKRPRETPSH 
AKRPRETPSP 
AKRPRETPSP 
AKRPRET PLH 

LLGAACVTFK 
LLGAACVTFK 
LLGAACVTFK 
LLGAACVTFK 
LLGAACVTFK 
LLGAACVTFK 

GAGATAEETR 
GAGATAEETR 
GAGATAEETR 
GAGATAEETR 
GAGATAEETR 
GAGATAEETR 



DLLFYDDTVS 
DLLFYDDTVS 
DLLFYDDTVS 
DLLFYDDTVS 
DLLFYDDTVS 
DLLFYDDTVS 

ERDIQDFVLT 
ERDIQDFVLT 
ERDIQDFVLT 
ERDIQDFVLT 
ERDIQDFVLT 
ERDIQDFVLT 

VIVAQTREVE 
VIVAQTREVE 
VIVAQTREVE 
VIVAQTREVE 
VIVAQTREVE 
VIVAQTREVE 

ADPPGGASKP 
ADPPGGASKP 
ADPPGGASKP 
ADPPGGASKP 
ADPPGGASKP 
ADPPGGASKP 

ALFGNNAKIT 
ALFGNNAKIT 
ALFGNNAKIT 
ALFGNNAKIT 
ALFGNNAKIT 
ALFGNNAKIT 



GAAAALAERP -: 

GAAAALAERP 

GAAAALAERP 

GAAAALAERP 

GAAAALAERP 

GAAAALAERP 

AELSRHPRAY 
AEL SRHPRAY 
AELSRHPRAY 
AELSRHPRAY 
AELSRHPRAY 
AELSRHPRAY 

ETVARLAALR 
ETVARLAALR 
ETVARLAALR 
ETVARLAALR 
ETVARLAALR 
ETVARLAALR 

RKLLVSELAE 
RKLLVSELAE 
RKLLVSELAE 
RKLLVSELAE 
RKLLVSELAE 
RKLLVSELAE 

ESLLKRFIPE 
ESLLKRFIPE 
ESLLKRFIPE 
ESLLKRFIPE 
ESLLKRFIPE 
ESLLKRFIPE 



-944 
-944 

-997 
-999 
-994 
-994 
-994 
-994 

-1047 
-1049 
-1044 
-1044 
-1044 
-1044 

-1097 
-1099 
-1094 
-1094 
-1094 
-1094 

-1147 
-1149 
-1144 
-1144 
-1144 
-1144 

-1197 
-1199 
-1194 
-1194 
-1194 
-1194 



RMLHRAFDTL A* -123 8 

RMLHRAFDTL A* -1240 

RMLHRAFDTL A* -123 5 

RMLHRAFDTL A* -123 5 

RMLHRAFDTL A* -1235 

RMLHRAFDTL A* -123 5 



* Amino acid alignment demonstrates difference in amino acid's sequences. 

*The gaps " " indicate missing amino acids relative to other stanins. 

*Wild HSV2-MS is listed as SEQ. ID NO 14. 
*Wild HSV2-186 is listed as SEQ. ID NO 15. 
*Wild HSV-Kos is listed as SEQ. ID NO 16. 
*Wild HSVl-Patton is listed as SEQ. ID NO 17. 
*Wild HSV1-DJL is listed as SEQ. ID NO 18. 
*Wild HSVl-F is listed as SEQ. ID NO 19. 



55 



9/34 



WO 02/06513 PCT/US01/16525 
#• 

Figure 5 DNA and amino acid sequence list 

SEQ. ID. NO. 1 DNA sequence of DNA polymerase gene for HSV2-MS-M 1 

5 1 ATGTTTTGTG CCGCGGGCGG CCCGACTTCC CCCGGGGGGA AGTCGGCGGC 

51 TCGGGCGGCG TCTGGGTTTT TTGCCCCCCA CAACCCCCGG GGAGCCACCC 
101 AGACGGCACC GCCGCCTTGC CGCCGGCAGA ACTTCTACAA CCCCCACCTC 

10 

151 GCTCAGACCG GAACGCAGCC AAAGGCCCCC GGGCCGGCTC AGCGCCATAC 
201 GTACTACAGC GAGTGCGACG AATTTCGATT TATCGCCCCG CGTTCGCTGG 
15 251 ACGAGGACGC CCCCGCGGAG CAGCGCACCG GGGTCCACGA CGGCCGCCTC 

301 CGGCGCGCCC CTAAGGTGTA CTGCGGGGGG GACGAGCGCG ACGTCCTCCG 
351 CGTGGGCCCG GAGGGCTTCT GGCCGCGTCG CTTGCGCCTG TGGGGCGGTG 

20 

401 CGGACCATGC CCCCAAGGGG TTCGACCCCA CCGTCACCGT CTTCCACGTG 
451 TACGACATCC TGGAGCACGT GGAACACGCG TACAGCATGC GCGCCGCCCA 
25 501 GCTCCACGAG CGATTTATGG ACGCCATCAC GCCCGCCGGG ACCGTCATCA 

551 CGCTTCTGGG TCTGACCCCC GAAGGCCATC GCGTCGCCGT TCACGTCTAC 
601 GGCACGCGGC AGTACTTTTA CATGAACAAG GCGGAGGTGG ATCGGCACCT 

30 

651 GCAGTGCCGT GCCCCGCGCG ATCTCTGCGA GCGCCTGGCG GCGGCCCTGC 
701 GCGAGTCGCC GGGGGCGTCG TTCCGCGGCA TCTCCGCGGA CCACTTCGAG 
35 75 1 GCGGAGGTGG TGGAGCGCGC CGACGTGTAC TATTACGAAA CGCGCCCGAC 

801 CCTGTACTAC CGCGTCTTCG TGCGAAGCGG GCGCGCGCTG GCCTACCTGT 
851 GCGACAACTT TTGCCCCGCG ATCAGGAAGT ACGAGGGGGG CGTCGACGCC 

40 

901 ACCACCCGGT TTATCCTGGA CAACCCGGGG TTTGTCACCT TCGGCTGGTA 
951 CCGCCTCAAG CCCGGCCGCG GGAACGCGCC GGCCCAACCG CGCCCCCCGA 
45 1001 CGGCGTTCGG AACCTCGAGC GACGTCGAGT TTAACTGCAC GGCGGACAAC 

1051 CTGGCCGTCG AGGGGGCCAT GTGTGACCTG CCGGCCTACA AGCTCATGTG 
1 101 CTTCGATATC GAATGCAAGG CCGGGGGGGA GGACGAGCTG GCCTTTCCGG 

50 

1 151 TCGCGGAACG CCCGGAAGAC CTCGTCATCC AGATCTCCTG TCTGCTCTAC 
1201 GACCTGTCCA CCACCGCCCT CGAGCACATC CTCCTGTTTT CGCTCGGATC 
55 1 25 1 CTGCGACCTC CCCGAGTCCC ACCTCAGCGA TCTCGCCTCC AGGGGCCTGC 

1301 CGGCCCCCGT CGTCCTGGAG TTTGACAGCG AATTCGAGAT GCTGCTGGCC 
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1 35 1 TTCATGACCT TCGTCAAGCA GTACGGCCCC G AGTTCGTGA CCGGGTACAA 
1401 CATCATCAAC TTCGACTGGC CCTTCGTCCT GACCAAGCTG ACGGAGATCT 
5 145 1 ACAAGGTCCC GCTCGACGGG TACGGGCGCA TGAACGGCCG GGGTGTGTTC 

1501 CGCGTGTGGG ACATCGGCCA GAGCCACTTT CAGAAGCGCA GCAAGATCAA 
155 1 GGTGAACGGG ATGGTGAACA TCGACATGTA CGGCATCATC ACCGACAAGG 

10 

1601 TCAAACTCTC CAGCTACAAG CTGAACGCCG TCGCCGAGGC CGTCTTGAAG 
1 65 1 GACAAGAAG A AGGATCTGAG CTACCGCG AC ATCCCCGCCT ACTACGCCTC 
15 1701 CGGGCCCGCG CAGCGCGGGG TGATCGGCGA GTATTGTGTG CAGGACTCGC 

1751 TGCTGGTCGG GCAGCTGTTC TTCAAGTTTC TGCCGCACCT GGAGCTTTCC 
1801 GCCGTCGCGC GCCTGGCGGG CATCAACATC ACCCGCACCA TCTACGACGG 

20 

1 851 CCAGCAGATC CGCGTCTTCA CGTGCCTCCT GCGCCTTGCG GGCCAGAAGG 
1901 GCTTCATCCT GCCGGACACC CAGGGGCGGT TTCGGGGCCT CGACAAGGAG 
25 1 95 1 GCGCCCAAGC GCCCGGCCGT GCCTCGGGGG GAAGGGGAGC GGCCGGGGGA 

2001 CGGGAACGGG GACGAGGATA AGGACGACGA CGAGGACGAG GACGGGGACG 
2051 AGCGCGAGGA GGTCGCGCGC GAGACCGGGG GCCGGCACGT TGGGTACCAG 

30 

2101 GGGGCCCGGG TCCTCGACCC CACCTCCGGG TTTCACGTCG ACCCCGTGGT 
2151 GGTGTTTGAC TTTGCCAGCC TGTACCCCAG CATCATCCAG GCCCACAACC 
35 2201 TGTGCTTCAG TACGCTCTCC CTGCGGCCCG AGGCCGTCGC GCACCTGGAG 
2251 GCGGACCGGG ACTACCTGGA GATCGAGGTG GGGGGCCGAC GGCTGTTCTT 
2301 CGTGAAGGCC CACGTACGCG AGAGCCTGCT GAGCATCCTG CTGCGCGACT 

40 

2351 GGCTGGCCAT GCGAAAGCAG ATCCGCTCGC GGATCCCCCA GAGCACCCCC 
2401 GAGGAGGCCG TCCTCCTCGA CAAGCAACAG GCCGCCATCA AGGTGGTGTG 
45 245 1 CAACTCGGTG TACGGGTTCA CCGGGGCGCA GCACGGTCTT CTGCCCTGCC 
2501 TGCACGTGGC CGCCACCGTG ACGACCATCG GCCGCGAGAT GCTCCTCGCG 
2551 ACGCGCGCGT ACGTGCACGC GCGCTGGGCG GAGTTCGATC AGCTGCTGGC 

50 

2601 CGACTTTCCG GAGGCGGCCG GCATGCGCGC CCCCGGTCCG TACTCCATGC 
2651 GCATCATCTA CGGGGACACG GACTCCATTT TCGTTTTGTG CCGCGGCCTC 
55 2701 ACGGCCGCGG GCCTGGTGGC CATGGGCGAC AAGATGGCGA GCCACATCTC 
2751 GCGCGCGCTG TTCCTCCCCC CGATCAAGCT CGAGTGCGAA AAAACGTTCA 
2801 CCAAGCTGCT GCTCATCGCC AAGAAAAAGT ACATCGGCGT CATCTGCGGG 

60 
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285 1 GGCAAGATGC TCATCAAGGG CGTGGATCTG GTGCGCAAAA ACAACTGCGC 
2901 GTTTATCAAC CGCACCTCCA GGGCCCTGGT CGACCTGCTG TTTTACGACG 
5 295 1 ATACCGTATC CGGAGCGGCC GCCGCGTTAG CCGAGCGCCC CGCAGAGGAG 
3001 TGGCTGGCGC GACCCCTGCC CGAGGGACTG CAGGCGTTCG GGGCCGTCCT 
305 1 CGTAGACGCC CATCGGCGCA TCACCGACCC GGAGAGGGAC ATCCAGGACT 

10 

3101 TTGTCCTCAC CGCCGAACTG AGCAGACACC CGCGCGCGTA CACCAACAAG 
3151 CGCCTGGCCC ACCTGACGGT GTATTACAAG CTCATGGCCC GCCGCGCGCA 
15 3201 GGTCCCGTCC ATCAAGGACC GGATCCCGTA CGTGATCGTG GCCCAGACCC 

325 1 GCGAGGTAGA GGAGACGGTC GCGCGGCTGG CCGCCCTCCG CGAGCTAGAC 
3301 GCCGCCGCCC CAGGGGACGA GCCCGCCCCC CCAGCGGCCC TGCCCTCCCC 

20 

335 1 GGCCAAGCGC CCCCGGGAGA CGCCGTCGCA TGCCGACCCC CCGGGAGGCG 
3401 CGTCCAAGCC CCGCAAGCTG CTGGTGTCCG AGCTGGCGGA GGATCCCGGG 
25 345 1 TACGCCATCG CCCGGGGCGT TCCGCTCAAC ACGGACTATT ACTTCTCGCA 
3501 CCTGCTGGGG GCGGCCTGCG TGACGTTCAA GGCCCTGTTT GGAAATAACG 
355 1 CCAAGATCAC CGAGAGTCTG TTAAAGAGGT TTATTCCCGA GACGTGGCAC 

30 

3601 CCCCCGGACG ACGTGGCCGC GCGGCTCAGG GCCGCGGGGT TCGGGCCGGC 
365 1 GGGGGCCGGC GCTACGGCGG AGGAAACTCG TCGAATGTTG CATAGAGCCT 
35 3701 TTGATACTCT AGCATGA 
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SEQ. ID. NO- 2 Amino acid sequence of DNA polymerase for HSV2-MS-M1 

1 MFCAAGGPTS PGGKSAARAA SGFFAPHNPR GATQTAPPPC RRQNFYNPHL 
5 5 1 AQTGTQPKAP GPAQRHTYYS ECDEFRFIAP RSLDEDAPAE QRTGVHDGRL 

101 RRAPKVYCGG DERDVLRVGP EGFWPRRLRL WGGADHAPKG FDPT VTVFH V 
151 YDILEHVEHA YSMRAAQLHE RFMDAITPAG TVITLLGLTP EGHRVAVHVY 

10 

201 GTRQYFYMNK AEVDRHLQCR APRDLCERLA AALRESPGAS FRGISADHFE 
251 AEWERADVY YYETRPTLYY RVFVRSGRAL AYLCDNFCPA IRKYEGGVDA 
15 301 TTRFILDNPG FVTFGWYRLK PGRGNAPAQP RPPTAFGTSS DVEFNCTADN 

35 1 LAVEGAMCDL P AYKLMCFDI ECKAGGEDEL AFPV AERPED LVIQISCLLY 
401 DLSTTALEHI LLFSLGSCDL PESHLSDLAS RGLPAPWLE FDSEFEMLLA 

20 

45 1 FMTFVKQYGP EFVTGYNIIN FDWPFVLTKL TEIYKVPLDG YGRMNGRGVF 
501 RVWDIGQSHF QKRSKIKVNG MVNIDMYGII TDKVKLSSYK LNAVAEAYLK 
25 55 1 DKKKDLS YRD IPAYYASGPA QRGVIGEYCV QDSLLVGQLF FKFLPHLELS 

601 AVARLAGINI TRTIYDGQQI RVFTCLLRLA GQKGFILPDT QGRFRGLDKE 
65 1 APKRPAVPRG EGERPGDGNG DEDKDDDEDE DGDEREEVAR ETGGRHVGYQ 

30 

701 GARVLDPTSG FHVDPVVVFD FASLYPSIIQ AHNLCFSTLS LRPEAVAHLE 
751 ADRDYLEIEV GGRRLFFVKA HVRESLLSIL LRDWLAMRKQ IRSRIPQSTP 
35 801 EEAVLLDKQQ AAIKWCNSV YGFTGAQHGL LPCLHVAATV TTIGREMLLA 

85 1 TRAYVHARWA EFDQLLADFP EAAGMRAPGP YSMRIIYGDT DSIFVLCRGL 
901 TAAGLVAMGD KMASHISRAL FLPPIKLECE KTFTKLLLIA KKKYIGVICG 

40 

95 1 GKMLIKGVDL VRKNNCAFIN RTSRALVDLL FYDDTVSGAA AALAERPAEE 
1001 WLARPLPEGL QAFGAVLVDA HRRITDPERD IQDFVLTAEL SRHPRAYTNK 
45 105 1 RLAHLTVYYK LMARRAQVPS IKDRIPYVIV AQTREVEETV ARLAALRELD 

1 101 AAAPGDEPAP PAALPSPAKR PRETPSHADP PGGASKPRKL LVSELAEDPG 
1151 YAIARGVPLN TDYYFSHLLG AACVTFKALF GNNAKITESL LKRFIPETWH 

50 

1201 PPDDVAARLR AAGFGPAGAG ATAEETRRML HRAFDTLA* 
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SEQJD.NO. 3 DNA sequence of DNA polymerase gene for HSV2-186-M1 

1 ATGTTTTGTG CCGCGGGCGG CCCGGCTTCC CCCGGGGGGA AGTCGGCGGC 
5 1 TCGGGCGGCG TCTGGGTTTT TTGCCCCCC A CAACCCCCGG GGAGCCACCC 
101 AGACGGCACC GCCGCCTTGC CGCCGGCAGA ACTTCTACAA CCCCCACCTC 
15 1 GCTCAGACCG GAACGCAGCC AAAGGCCCCC GGGCCGGCTC AGCGCCATAC 
201 GTACTACAGC GAGTGCGACG AATTTCGATT TATCGCCCCG CGTTCGCTGG 
25 1 ACGAGGACGC CCCCGCGGAG CAGCGC ACCG GGGTCCACG A CGGCCGCCTC 
301 CGGCGCGCCC CTAAGGTGTA CTGCGGGGGG GACGAGCGCG ACGTCCTCCG 
35 1 CGTGGGCCCG GAGGGCTTCT GGCCGCGTCG CTTGCGCCTG TGGGGCGGTG 
401 CGGACCATGC CCCCGAGGGG TTCGACCCCA CCGTCACCGT CTTCCACGTG 
451 TACGACATCC TGGAGCACGT GGAACACGCG TACAGCATGC GCGCCGCCCA 
501 GCTCCACGAG CGATTTATGG ACGCCATCAC GCCCGCCGGG ACCGTCATCA 
55 1 CGCTTCTGGG TCTGACCCCC GAAGGCCATC GCGTCGCCGT TCACGTCTAC 
601 GGCACGCGGC AGTACTTTTA CATGAACAAG GCGGAGGTGG ATCGGCACCT 
65 1 GC AGTGCCGT GCCCCGCGCG ATCTCTGCGA GCGCCTGGCG GCGGCCCTGC 
701 GCGAGTCGCC GGGGGCGTCG TTCCGCGGCA TCTCCGCGGA CCACTTCGAG 
75 1 GCGGAGGTGG TGGAGCGCGC CGACGTGTAC TATTACGAAA CGCGCCCGAC 
801 CCTGTACTAC CGCGTCTTCG TGCGAAGCGG GCGCGCGCTG GCCTACCTGT 
85 1 GCGACAACTT TTGCCCCGCG ATCAGGAAGT ACGAGGGGGG CGTCGACGCC 
901 ACCACCCGGT TTATCCTGGA CAACCCGGGG TTTGTCACCT TCGGCTGGTA 
951 CCGCCTCAAG CCCGGCCGCG GGAACGCGCC GGCCCAACCG CGCCCCCCGA 
1001 CGGCGTTCGG AACCTCGAGC GACGTCGAGT TTAACTGCAC GGCGGACAAC 
1 05 1 CTGGCCGTCG AGGGGGCCAT GTGTG ACCTG CCGGCCTACA AGCTC ATGTG 
1 101 CTTCGATATC GAATGCAAGG CCGGGGGGGA GGACGAGCTG GCCTTTCCGG 
1 151 TCGCGGAACG CCCGGAAGAC CTCGTCATCC AGATCTCCTG TCTGCTCTAC 
1201 GACCTGTCCA CCACCGCCCT CGAGCACATC CTCCTGTTTT CGCTCGGATC 
1251 CTGCGACCTC CCCGAGTCCC ACCTCAGCGA TCTCGCCTCC AGGGGCCTGC 
1301 CGGCCCCCGT CGTCCTGGAG TTTGACAGCG AATTCGAGAT GCTGCTGGCC 
135 1 TTCATGACCT TCGTCAAGCA GTACGGCCCC GAGTTCGTGA CCGGGTACAA 
1401 CATCATCAAC TTCGACTGGC CCTTCGTCCT GACCAAGCTG ACGGAGATCT 
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145 1 ACAAGGTCCC GCTCGACGGG TACGGGCGCA TGAACGGCCG GGGTGTGTTC 
1501 CGCGTGTGGG ACATCGGCCA GAGCCACTTT CAGAAGCGCA GCAAGATCAA 
5 1551 GGTGAACGGG ATGGTGAAC A TCGAC ATGTA CGGC ATCATC ACCG ACAAGG 

1601 TCAAACTCTC CAGCTACAAG CTGAACGCCG TCGCCGAGGC CGTCTTGAAG 
1 65 1 G ACAAGAAGA AGG ATCTGAG CTACCGCGAC ATCCCCGCCT ACTACGCCTC 

10 

1701 CGGGCCCGCG CAGCGCGGGG TGATCGGCGA GTATTGTGTG CAGGACTCGC 
175 1 TGCTGGTCGG GCAGCTGTTC TTCAAGTTTC TGCCGCACCT GGAGCTTTCC 
15 1801 GCCGTCGCGC GCCTGGCGGG CATCAACATC ACCCGCACCA TCTACGACGG 

1 85 1 CCAGCAGATC CGCGTCTTCA CGTGCCTCCT GCGCCTTGCG GGCCAGA AGG 
1901 GCTTCATCCT GCCGGACACC CAGGGGCGGT TTCGGGGCCT CGACAAGGAG 

20 

1951 GCGCCCAAGC GCCCGGCCGT GCCTGGGGGG GAAGGGGAGC GGCCGGGGGA 
2001 CGGGAACGGG GACGAGGATA AGGACGACGA CGAGGACGGG GACGAGGACG 
25 205 1 GGGACGAGCG CGAGGAGGTC GCGCGCGAGA CCGGGGGCCG GCACGTTGGG 

2101 TACCAGGGGG CCCGGGTCCT CGACCCCACC TCCGGGTTTC ACGTCGACCC 
2151 CGTGGTGGTG TTTGACTTTG CCAGCCTGTA CCCCAGCATC ATCCAGGCCC 

30 

2201 ACAACCTGTG CTTCAGTACG CTCTCCCTGC GGCCCGAGGC CGTCGCGCAC 
225 1 CTGGAGGCGG ACCGGGACTA CCTGGAGATC GAGGTGGGGG GCCGACGGCT 
35 2301 GTTCTTCGTG AAGGCCCACG TACGCGAGAG CCTGCTGAGC ATCCTGCTGC 

2351 GCGACTGGCT GGCCATGCGA AAGCAGATCC GCTCGCGGAT CCCCCAGAGC 
2401 CCCCCCGAGG AGGCCGTCCT CCTCGACAAG CAACAGGCCG CCATCAAGGT 

40 

245 1 GGTGTGCAAC TCGGTGTACG GGTTCACCGG GGCGCAGCAC GGTCTTCTGC 
2501 CCTGCCTGCA CGTGGCCGCC ACCGTGACGA CCATCGGCCG CGAGATGCTC 
45 255 1 CTCGCGACGC GCGCGTACGT GCACGCGCGC TGGGCGGAGT TCGATCAGCT 

2601 GCTGGCCGAC TTTCCGGAGG CGGCCGGCAT GCGCGCCCCC GGTCCGTACT 
265 1 CC ATGCGCAT C ATCTACGGG GACACGGACT CCATTTTCGT TTTGTGCCGC 

50 

2701 GGCCTCACGG CCGCGGGCCT GGTGGCCATG GGCGACAAGA TGGCGAGCCA 
2751 CATCTCGCGC GCGCTGTTCC TCCCCCCGAT CAAGCTCGAG TGCGAAAAAA 
55 2801 CGTTCACCAA GCTGCTGCTC ATCGCCAAGA AAAAGTACAT CGGCGTCATC 

285 1 TGCGGGGGCA AGATGCTCAT CAAGGGCGTG GATCTGGTGC GCAAAAACAA 
2901 CTGCGCGTTT ATCAACCGCA CCTCCAGGGC CCTGGTCGAC CTGCTGTTTT 

60 

15/34 



WO 02/06513 PCT/US01/16525 
295 1 ACGACGATAC CGTATCCGG A GCGGCCGCCG CGTTAGCCG A GCGCCCCGCA 
3001 GAGGAGTGGC TGGCGCGACC CCTGCCCGAG GGACTGCAGG CGTTCGGGGC 
5 305 1 CGTCCTCGTA G ACGCCCATC GGCGCATCAC CGACCCGGAG AGGGACATCC 

3101 AGGACTTTGT CCTCACCGCC GAACTGAGCA GACACCCGCG CGCGTACACC 
3151 AACAAGCGCC TGGCCCACCT GACGGTGTAT TACAAGCTCA TGGCCCGCCG 

10 

3201 CGCGCAGGTC CCGTCCATCA AGGACCGGAT CCCGTACGTG ATCGTGGCCC 
3251 AGACCCGCGA GGTAGAGGAG ACGGTCGCGC GGCTGGCCGC CCTCCGCGAG 
15 3301 CTAGACGCCG CCGCCCCAGG GGACGAGCCC GCCCCCCCAG CGGCCCTGCC 

335 1 CTCCCCGGCC AAGCGCCCCC GGGAGACGCC GTCGCATGCC GACCCCCCGG 
3401 GAGGCGCGTC CAAGCCCCGC AAGCTGCTGG TGTCCGAGCT GGCGGAGGAT 

20 

345 1 CCCGGGTACG CCATCGCCCG GGGCGTTCCG CTCAACACGG ACTATTACTT 
3501 CTCGCACCTG CTGGGGGCGG CCTGCGTGAC GTTCAAGGCC CTGTTTGGAA 
25 355 1 ATAACGCC AA GATC ACCGAG AGTCTGTTAA AGAGGTTTAT TCCCGAGACG 

3601 TGGCACCCCC CGGACGACGT GGCCGCGCGG CTCAGGGCCG CGGGGTTCGG 
365 1 GCCGGCGGGG GCCGGCGCTA CGGCGGAGGA AACTCGTCGA ATGTTGCATA 

30 

3701 GAGCCTTTGA TACTCTAGCA TGA 



16/34 



WO 02/06513 



PCT/US01/16525 



10 



SEQ.ID.NO. 4 Amino acid sequence of DNA polymerase for HS V2- 1 86-M 1 

1 MFCAAGGPAS PGGKS AARAA SGFFAPHNPR GATQTAPPPC RRQNFYNPHL 
5 1 AQTGTQPKAP GPAQRHTYYS ECDEFRFIAP RSLDEDAPAE QRTGVHDGRL 
101 RRAPKVYCGG DERDVLRVGP EGFWPRRLRL WGGADHAPEG FDPTVTVFHV 
151 YDILEHVEHA YSMRAAQLHE RFMDAITPAG TVITLLGLTP EGHRVAVHVY 
201 GTRQYFYMNK AEVDRHLQCR APRDLCERLA AALRESPGAS FRGISADHFE 
15 25 1 AEVVERADVY YYETRPTLYY RVFVRSGRAL AYLCDNFCPA IRKYEGGVDA 

301 TTRFILDNPG FVTFGWYRLK PGRGNAPAQP RPPTAFGTSS DVEFNCTADN 
35 1 LAVEGAMCDL PAYKLMCFDI ECKAGGEDEL AFPVAERPED LVIQISCLLY 

20 

401 DLSTTALEHI LLFSLGSCDL PESHLSDLAS RGLPAPWLE FDSEFEMLLA 
451 FMTFVKQYGP EFVTGYNIIN FDWPFVLTKL TEIYKVPLDG YGRMNGRGVF 
25 501 RVWDIGQSHF QKRSKIKVNG MVNIDMYGII TDKVKLSSYK LNAVAEAVLK 

55 1 DKKKDLS YRD IPAYYASGPA QRGVIGEYCV QDSLLVGQLF FKFLPHLELS 
601 AVARLAGINI TRTIYDGQQI RVFTCLLRLA GQKGFILPDT QGRFRGLDKE 

30 

651 APKRPAVPRG EGERPGDGNG DEDKDDDEDG DEDGDEREEV ARETGGRHVG 
701 YQGARVLDPT SGFHVDPVVV FDFASLYPSI IQAHNLCFST LSLRPEAVAH 
35 75 1 LEADRDYLEI EVGGRRLFFV KAHVRESLLS ILLRDWLAMR KQIRSRIPQS 

801 PPEEAVLLDK QQAAIKWCN SVYGFTGAQH GLLPCLHVAA TVTTIGREML 
851 LATRAYVHAR WAEFDQLLAD FPEAAGMRAP GPYSMRIIYG DTDSIFVLCR 

40 

901 GLTAAGLVAM GDKMASHISR ALFLPPIKLE CEKTFTKLLL IAKKKYIGVI 
95 1 CGGKMLIKGV DLVRKNNC AF INRTSRALVD LLFYDDTVSG AAAALAERPA 
45 1001 EEWLARPLPE GLQAFGAVLV DAHRRITDPE RDIQDFVLTA ELSRHPRAYT 

1051 NKRLAHLTVY YKLMARRAQV PSIKDRIPYV IVAQTREVEE TVARLAALRE 
1 101 LDAAAPGDEP APPAALPSPA KRPRETPSHA DPPGGASKPR KLLVSELAED 
1151 PGYAIARGVP LNTDYYFSHL LGAACVTFKA LFGNNAKITE SLLKRFIPET 
1201 WHPPDDVAAR LRAAGFGPAG AGATAEETRR MLHRAFDTLA * 
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SEQ.ID.NO- 5 DNA sequence of DNA polymerase gene for HSV1-KOS-M1 

1 ATGTTTTCCG GTGGCGGCGG CCCGCTGTCC CCCGGAGGAA AGTCGGCGGC 
5 5 1 CAGGGCGGCG TCCGGGTTTT TTGCGCCCGC CGGCCCTCGC GGAGCCGGCC 

101 GGGGACCCCC GCCTTGTTTG AGGCAAAACT TTTACAACCC CTACCTCGCC 
151 CCAGTCGGGA CGCAACAGAA GCCGACCGGG CCAACCCAGC GCCATACGTA 

10 

201 CTATAGCGAA TGCGATGAAT TTCGATTCAT CGCCCCGCGG GTGCTGGACG 
251 AGGATGCCCC CCCGGAGAAG CGCGCCGGGG TGCACGACGG TCACCTCAAG 
15 301 CGCGCCCCCA AGGTGTACTG CGGGGGGGAC GAGCGCGACG TCCTCCGCGT 

351 CGGGTCGGGC GGCTTCTGGC CGCGGCGCTC GCGCCTGTGG GGCGGCGTGG 
401 ACCACGCCCC GGCGGGGTTC AACCCCACCG TCACCGTCTT TCACGTGTAC 

20 

451 GACATCCTGG AGAACGTGGA GCACGCGTAC GGCATGCGCG CGGCCCAGTT 
501 CCACGCGCGG TTTATGGACG CCATCACACC GACGGGGACC GTCATCACGC 
25 55 1 TCCTGGGCCT GACTCCGGAA GGCCACCGGG TGGCCGTTC A CGTTTACGGC 

601 ACGCGGCAGT ACTTTTACAT GAACAAGGAG GAGGTTGACA GGCACCTACA 
651 ATGCCGCGCC CCACGAGATC TCTGCGAGCG CATGGCCGCG GCCCTGCGCG 

30 

701 AGTCCCCGGG CGCGTCGTTC CGCGGCATCT CCGCGGACCA CTTCGAGGCG 
75 1 GAGGTGGTGG AGCGCACCGA CGTGTACTAC TACGAGACGC GCCCCGCTCT 
35 801 GTTTTACCGC GTCTACGTCC GAAGCGGGCG CGTGCTGTCG TACCTGTGCG 

851 ACAACTTCTG CCCGGCCATC AAGAAGTACG AGGGTGGGGT CGACGCCACC 
901 ACCCGGTTCA TCCTGGACAA CCCCGGGTTC GTCACCTTCG GCTGGTACCG 

40 

95 1 TCTCAAACCG GGCCGG AACA ACACGCTAGC CCAGCCGCGG GCCCCG ATGG 
1001 CCTTCGGGAC ATCCAGCGAC GTCGAGTTTA ACTGTACGGC GGACAACCTG 
45 1051 GCCATCGAGG GGGGCATGAG CGACCTACCG GCATACAAGC TCATGTGCTT 

1 101 CGATATCGAA TGCAAGGCGG GGGGGGAGGA CGAGCTGGCC TTTCCGGTGG 
1 151 CCGGGCACCC GGAGGACCTG GTTATTCAGA TATCCTGTCT GCTCTACGAC 

50 

1201 CTGTCCACCA CCGCCCTGGA GCACGTCCTC CTGTTTTCGC TCGGTTCCTG 
1251 CGACCTCCCC GAATCCCACC TGAACGAGCT GGCGGCCAGG GGCCTGCCCA 
55 1301 CGCCCGTGGT TCTGGAATTC GACAGCGAAT TCGAGATGCT GTTGGCCTTC 

1351 ATGACCCTTG TGAAACAGTA CGGCCCCGAG TTCGTGACCG GGTACAACAT 
1401 CATCAACTTC GACTGGCCCT TCTTGCTGGC CAAGTTGACG GACATTTACA 

60 
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1451 AGGTCCCCCT GGACGGGTAC GGCCGCATGA ACGGCCGGGG CGTGTTTCGC 
1501 GTGTGGGACA TAGGCCAGAG CCACTTCCAG AAGCGCAGCA AGATAAAGGT 
5 155 1 GAACGGCATG GTGAACATCG ACATGTACGG GATCATAACC GACAAGATCA 

1601 AGCTCTCGAG CTACAAGCTC AACGCCGTGG CCGAAGCCGT CCTGAAGGAC 
165 1 AAGAAGAAGG ACCTGAGCTA TCGCGACATC CCCGCCTACT ACGCCGCCGG 
1701 GCCCGCGCAA CGCGGGGTGA TCGGCGAGTA CTGCATACAG GATTCCCTGC 
175 1 TGGTGGGCCA GCTGTTTTTT AAGTTTTTGC CCCATCTGGA GCTCTCGGCC 
15 1 80 1 GTCGCGCGCT TGGCGGGTAT TAACATCACC CGC ACCATCT ACG ACGGCCA 

1851 GCAGATCCGC GTCTTTACGT GCCTGCTGCG CCTGGCCGAC CAGAAGGGCT 
1901 TTATTCTGCC GGACACCCAG GGGCGATTTA GGGGCGCCGG GGGGGAGGCG 

20 

1951 CCCAAGCGTC CGGCCGCAGC CCGGGAGGAC GAGGAGCGGC CAGAGGAGGA 
2001 GGGGGAGGAC GAGGACGAAC GCGAGGAGGG CGGGGGCGAG CGGGAGCCGG 
25 205 1 AGGGCGCGCG GGAGACCGCC GGCCGGCACG TGGGGTACCA GGGGGCCAGG 

2101 GTCCTTGACC CCACTTCCGG GTTTCACGTG AACCCCGTGG TGGTGTTCGA 
215 1 CTTTGCCAGC CTGTACCCCA GCATCATCCA GGCCCACAAC CTGTGCTTCA 

30 

2201 GCACGCTCTC CCTGAGGGCC GACGCAGTGG CGCACCTGGA GGCGGGCAAG 
225 1 GACTACCTGG AGATCGAGGT GGGGGGGCGA CGGCTGTTCT TCGTCAAGGC 
35 2301 TCACGTGCGA GAGAGCCTCC TCAGCATCCT CCTGCGGGAC TGGCTCGCCA 

2351 TGCGAAAGCA GATCCGCTCG CGGATTCCCC AGAGCAGCCC CGAGGAGGCC 
2401 GTGCTCCTGG ACAAGCAGCA GGCCGCCATC AAGGTCGTGT GTAACTCGGT 

40 

245 1 GTACGGGTTC ACGGGAGCGC AGCACGG ACT CCTGCCGTGC CTGCACGTTG 
2501 CCGCGACGGT GACGACCATC GGCCGCGAGA TGCTGCTCGC GACCCGCGAG 
45 255 1 TACGTCCACG CGCGCTGGGC GGCCTTCGAA CAGCTCCTGG CCGATTTCCC 

2601 GGAGGCGGCC GACATGCGCG CCCCCGGGCC CTATTCCATG CGCATCATCT 
265 1 ACGGGG ACAC GGACTCCATA TTTGTGCTGT GCCGCGGCCT CACGGCCGCC 

50 

2701 GGGCTGACGG CCATGGGCGA CAAGATGGCG AGCCACATCT CGCGCGCGCT 
2751 GTTTCTGCCC CCCATCAAAC TCGAGTGCGA AAAGACGTTC ACCAAGCTGC 
55 2801 TGCTGATCGC CAAGAAAAAG TACATCGGCG TCATCTACGG GGGTAAGATG 

2851 CTCATCAAGG GCGTGGATCT GGTGCGCAAA AACAACTGCG CGTTTATCAA 
2901 CCGCACCTCC AGGGCCCTGG TCGACCTGCT GTTTTACGAC GATACCGTAT 

60 



19/34 



WO 02/06513 PCT/US01/16525 
2951 CCGGAGCGGC CGCCGCGTTA GCCGAGCGCC CCGCAGAGGA GTGGCTGGCG 
3001 CGACCCCTGC CCGAGGGACT GCAGGCGTTC GGGGCCGTCC TCGTAGACGC 
5 305 1 CCATCGGCGC ATCACCGACC CGGAGAGGGA CATCCAGGAC TTTGTCCTCA 

3101 CCGCCG AACT GAGC AGACAC CCGCGCGCGT ACACCAAC AA GCGCCTGGCC 
3 15 1 CACCTGACGG TGTATTACAA GCTCATGGCC CGCCGCGCGC AGGTCCCGTC 

10 

3201 CATCAAGGAC CGGATCCCGT ACGTGATCGT GGCCCAGACC CGCGAGGTAG 
325 1 AGGAGACGGT CGCGCGGCTG GCCGCCCTCC GCGAGCTAGA CGCCGCCGCC 
15 . 3301 CCAGGGGACG AGCCCGCCCC CCCCGCGGCC CTGCCCTCCC CGGCCAAGCG 
3351 CCCCCGGGAG ACGCCGTCGC ATGCCGACCC CCCGGGAGGC GCGTCCAAGC 
3401 CCCGCAAGCT GCTGGTGTCC GAGCTGGCCG AGGATCCCGC ATACGCCATT 

20 

345 1 GCCCACGGCG TCGCCCTGAA C ACGGACTAT TACTTCTCCC ACCTGTTGGG 
3501 GGCGGCGTGC GTGACATTCA AGGCCCTGTT TGGGAATAAC GCCAAGATCA 
25 355 1 CCGAGAGTCT GTTAAAAAGG TTTATTCCCG AAGTGTGGCA CCCCCCGGAC 

3601 GACGTGGCCG CGCGGCTCCG GGCCGCAGGG TTCGGGGCGG TGGGTGCCGG 
3651 CGCTACGGCG GAGGAAACTC GTCGAATGTT GCATAGAGCC TTTGATACTC 

30 

3701 TAGCATGA 
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SEQ.ID.NO. 6 Amino acid sequence of DNA polymerase for HSV1-KOS-M1 
1 MFSGGGGPLS PGGKSAARAA SGFFAPAGPR GAGRGPPPCL RQNFYNPYLA 

5 

5 1 PVGTQQKPTG PTQRHTYYSE CDEFRFI APR VLDEDAPPEK RAGVHDGHLK 
101 RAPKVYCGGD ERDVLRVGSG GFWPRRSRLW GGVDHAPAGF NPTVTVFHVY 
10 151 DILENVEH AY GMRAAQFHAR FMD AITPTGT VITLLGLTPE GHRVAVHV YG 

201 TRQYFYMNKE EVDRHLQCRA PRDLCERMAA ALRESPGASF RGISADHFEA 
251 EVVERTDVYY YETRPALFYR VYVRSGRVLS YLCDNFCPAI KKYEGGVDAT 

15 

301 TRFILDNPGF VTFGWYRLKP GRNNTLAQPR APMAFGTSSD VEFNCTADNL 
35 1 AIEGGMSDLP AYKLMCFDIE CKAGGEDELA FPVAGHPEDL VIQISCLLYD 
20 401 LSTTALEHVL LFSLGSCDLP ESHLNELAAR GLPTPWLEF DSEFEMLLAF 

45 1 MTLVKQYGPE FVTGYNIINF DWPFLLAKLT DIYKVPLDGY GRMNGRGVFR 
501 VWDIGQSHFQ KRS KIKVNGM VNIDMYGIIT DKIKLSSYKL NAVAEAVLKD 

25 

55 1 KKKDLSYRDI PAYYAAGPAQ RGVIGEYCIQ DSLLVGQLFF KFLPHLELSA 
601 VARLAGINIT RTIYDGQQIR VFTCLLRLAD QKGFILPDTQ GRFRGAGGEA 
30 65 1 PKRPAAARED EERPEEEGED EDEREEGGGE REPEGARETA GRHVGYQGAR 

701 VLDPTS GFH V NPVWFDFAS LYPSIIQAHN LCFSTLSLRA DAVAHLEAGK 
751 DYLEIEVGGR RLFFVKAHVR ESLLSILLRD WLAMRKQIRS RIPQSSPEEA 

35 

801 VLLDKQQAAI KVVCNSVYGF TGAQHGLLPC LHVAATVTTI GREMLLATRE 
85 1 YVHARWAAFE QLLADFPEAA DMRAPGPYSM RIIYGDTDSI FVLCRGLTAA 
40 901 GLTAMGDKMA SHISRALFLP PIKLECEKTF TKLLLIAKKK YIGVIYGGKM 

95 1 LIKGVDLVRK NNCAFINRTS RALVDLLFYD DTVSGAAAAL AERPAEEWLA 
1001 RPLPEGLQAF GAVLVDAHRR ITDPERDIQD FVLTAELSRH PRAYTNKRLA 

45 

1051 HLTVYYKLMA RRAQVPSIKD RIPYVIVAQT REVEETVARL AALRELDAAA 
1 101 PGDEPAPPAA LPSPAKRPRE TPSHADPPGG ASKPRKLLVS ELAEDPAYAI 
50 1151 AHGVALNTDY YFSHLLGAAC VTFKALFGNN AKITESLLKR FIPEVWHPPD 

1201 DVAARLRAAG FGAVGAGATA EETRRMLHRA FDTLA* 
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SEQ.ID.NO- 7 DNA sequence of HSV polymerase gene for HSV1-F-M1 



1 ATGTTTTCCG GTGGCGGCGG CCCGCTGTCC CCCGGAGGAA AGTCGGCGGC 

5 

51 CAGGGCGGCG TCCGGGTTTT TTGCGCCCGC CGGCCCTCGC GGAGCCGGCC 

101 GGGGACCCCC GCCTTGCTTG AGGCAAAACT TTTACAACCC CTACCTCGCC 

10 151 C C AGTCGGG A CGCAACAGAA GCCGACCGGG CCAACCCAGC GCCATACGTA 

2 01 CTATAGCGAA TGCGATGAAT TTCGATTCAT CGCCCCGCGG GTGCTGGACG 
251 AGGATGCCCC CCCGGAGAAG CGCGCCGGGG TGCACGACGG TCACCTCAAG 

15 

3 01 CGCGCCCCCA AGGTGTACTG CGGGGGGGAC GAGCGCGACG TCCTCCGCGT 
3 51 CGGGTCGGGC GGCTTCTGGC CGCGGCGCTC GCGCCTGTGG GGCGGCGTGG 

20 401 ACCACGCCCC GGCGGGGTTC AACCCCACCG TCACCGTCTT TCACGTGTAC 

451 GACATC CTGG AGAACGTGGA GCACGCGTAC GGCATGCGCG CGGCCCAGTT 

501 CCACGCGCGG TTTATGGACG CCATCACACC GACGGGGACC GTCATCACGC 

25 

551 TCCTGGGCCT GACTCCGGAA GGCCACCGGG TGGCCGTTCA CGTTTACGGC 

601 ACGCGGCAGT ACTTTTACAT GAACAAGGAG GAGGTCGACA GGCACCTACA 

30 651 ATGCCGCGCC CCACGAGATC TCTGCGAGCG CATGGCCGCG GCCCTGCGCG 

7 01 AGTCCCCGGG CGCGTCGTTC CGCGGCATTT CCGCGGACCA CTTCGAGGCG 

751 GAGGTGGTGG AGCGCACCGA CGTGTACTAC TACGAGACGC GCCCCGCTCT 

35 

801 GTTTTACCGC GTCTACGTCC GAAGCGGGCG CGTGCTGTCG TACCTGTGCG 

851 ACAACTTCTG CCCGGCCATC AAGAAGTACG AGGGTGGGGT CGACGCCACC 

40 9 01 ACCCGGTTCA TCCTGGACAA CCCCGGGTTC GTCACCTTCG GCTGGTACCG 

951 TCTCAAACCG GGCCGGAACA ACACGCTAGC CCAGCCGCGG GCCCCGATGG 

1001 CCTTCGGGAC ATCCAGCGAC GTCGAGTTTA ACTGTACGGC GGACAACCTG 

45 

1051 GCCATCGAGG GGGGCATGAG CGACCTACCG GCATACAAGC TCATGTGCTT 

1101 CGATATCGAA TGCAAGGCGG GGGGGGAGGA CGAGCTGGCC TTTCCGGTGG 

50 1151 CCGGGCACCC GGAGGACCTG GTCATCCAGA TATCCTGTCT GCTCTACGAC 

12 01 CTGTCCACCA CCGCCCTGGA GCACGTCCTC CTGTTTTCGC TCGGTTCCTG 
1251 CGACCTCCCC GAATCCCACC TGAACGAGCT GGCGGCCAGG GGCCTGCCCA 

55 

13 01 CGCCCGTGGT TCTGGAATTC GACAGCGAAT TCGAGATGCT GTTGGCCTTC 
1351 ATGACCCTTG TGAAACAGTA CGGCCCCGAG TTCGTGACCG GGTACAACAT 

60 1401 CATCAACTTC GACTGGCCCT TCTTGCTGGC CAAGCTGACG GACATTTACA 

1451 AGGTCCCCCT GGACGGGTAC GGCCGCATGA ACGGCCGGGG CGTGTTTCGC 
1501 GTGTGGGACA TAGGCCAGAG CCACTTCCAG AAGCGCAGCA AGATAAAGGT 
1551 GAACGGCATG GTGAACATCG ACATGTACGG GATTATAACC GACAAGATCA 
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65 



1601 


AGCTCTCGAG 


CTACAAGCTC 


AACGCCGTGG 


CCGAAGCCLrT 


C C TGAAGGAC 


1651 


AAGAAGAAGG 


ACCTGAGCTA 


TCGCGACATC 


CCCGL.L. 1 AL_T 


ALbLCGLLbb 


1701 


GCCCGCGCAA 


CGCGGGGTGA 


TCGGCGAGTA 


CTGL. A 1 At ALt 


LAi J.L.CC ILxL 


1751 


TGGTGGGCCA 


GCTGTTTTTT 


AAGTTTTTGC 


CCCA I L. 1 LjLtA 


LjL- i LTLbGLL 


1801 


GTCGCGCGCT 


TGGCGGGTAT 


T AAC ATC AC C 


CGCALLA JL L. 1 


ALbALbbLLA 


1851 


GCAGATCCGC 


GTCTTTACGT 


GCCTGCTGCG 


CCTGGCCGAC 


LAbAAGGbLT 


1901 


TTATTCTGCC 


GGACACCCAG 


GGGCGATTTA 


GGGGCGGCGG 


CaGGGGAGGCG 


1951 


CCCAAGCGTC 


CGGCCGCAGC 


CCGGGAGGAC 


G AGGAG LbbL 


LAbAGbAbbA 


2001 


GGGGGAGGAC 


GAGGACGAAC 


GCGAGGAGGG 


CGGGGGCGAG 


CGGGAGCCGG 


2051 


AGGGCGCGCG 


GGAGACCGCC 


GGCCGGCACG 


TGGGGTACCA 


GGGGGCCAGG 


2101 


GTCCTTGACC 


CCACTTCCGG 


GTTTCATGTG 


AACCCCGTGG 


TGGTGTTCGA 


2151 


CTTTGCCAGC 


CTGTACCCCA 


GCATCATCCA 


GGCCCACAAC 


CTGTGCTTCA 


2201 


GCACGCTCTC 


CCTGAGGGCC 


GACGCAGTGG 


CGCACC TGGA 


GGCGGGCAAG 


2251 


GACTACCTGG 


AGATCGAGGT 


GGGGGGGCGA 


CGGCTGTTCT 


TCGTCAAGGC 


2301 


TCACGTGCGA 


GAGAGCCTCC 


TCAGCATCCT 


CCTGCGGGAC 


TGGCTCGCCA 


2351 


TGCGAAAGCA 


GATCCGCTCG 


CGGATTCCCC 


AGAGCAGCCC 


CGAGGAGGCC 


2401 


GTGCTCCTGG 


ACAAGCAGCA 


GGCCGCCATC 


AAGGTCGTGT 


z-~t m tv tv nmnonm 

GTAACTCGGT 


2451 


TTACGGGTTC 


ACGGGAGCGC 


AGCACGGACT 


CCTGCCGTGC 


CTGCACGTTG 


2501 


CCGCGACGGT 


GACGACCATC 


GGCCGCGAGA 


TGCTGCTCGC 


GACCCGCGAG 


2551 


TACGTCCACG 


CGCGCTGGGC 


GGCCTTCGAA 


CAGCTCCTGG 


CCGATTTCCC 


2601 


GGAGGCGGCC 


GACATGCGCG 


CCCCCGGGCC 


CTATTCCATG 


CGCATCATCT 


2651 


ACGGGGACAC 


GGACTCCATC 


TTTGTGCTGT 


GCCGCGGCCT 


LALbbLLbLL 


2701 


GGGCTGACGG 


CCGTGGGCGA 


CAAGATGGCG 


AGCCACATC i 


CGL.GCGL.GL. I 


2751 


GTTTCTGTCC 


CCCATCAAAC 


TCGAGTGCGA 


AAALiACLj 11L 


7v a a nrrnpp 
AL.L-AALjL.TLiL. 


2801 


TGCTGATCGC 


CAAGAAAAAG 


TACATCGGCG 


TLA 1 LiALbb 


LjLjLi lAAbA X Lj 


2851 


CTCATCAAGG 


GCGTGGATCT 


GGTGCGCAAA 


AACAACTGGLr 


L.Lt1 1 1A1LAA 


2901 


CCGCACCTCC 


AGGGCCCTGG 


TCGACCTGCT 


GTTTTAC GAL 


0 A m A PPPrn7\ rp 

bA 1 AL. L.L; 1 A 1 


2951 


CCGGAGCGGC 


CGCCGCGTTA 


GCCGAGCGCC 


/""I /~1 /~1 TV /~t TV /^IZ-I TV 

C CGCAGAGG A 


GTGGL.TGGL.G 


3001 


CGACCCCTGC 


CCGAGGGACT 


GCAGGCGTTC 


GGGGCCGTCC 


T CGTAGAC GC 


3051 


CCATCGGCGC 


ATCACCGACC 


CGGAGAGGGA 


CATC C AGG AC 


TTTGTCCTCA 


3101 


CCGCCGAACT 


GAGCAGACAC 


CCGCGCGCGT 


AC AC C AAC AA 


GCGCCTGGCC 


3151 


CACCTGACGG 


TGTATTACAA 


GCTCATGGCC 


CGCCGCGCGC 


AGGTCCCGTC 


3201 


CATCAAGGAC 


CGGATCCCGT 


ACGTGATCGT 


GGCCCAGACC 


CGCGAGGTAG 
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3251 AGGAGACGGT CGCGCGGCTG 

33 01 CCAGGGGACG AGCCCGCCCC 

3351 CCCCCGGGAG ACGCCGTTGC 

3 401 CCCGCAAGCT GCTGGTGTCC 

3451 GCCCACGGCG TCGCCCTGAA 

3501 GGCGGCGTGC GTGACATTCA 

3551 CCGAGAGTCT GTTAAAAAGG 

3 601 GACGTGGCCG CGCGGCTCCG 

3 651 CGCTACGGCG GAGGAAACTC 

37 01 TAGCATGA 



PCT/US01/16525 

GCCGCCCTCC GCGAGCTCGA CGCCGCCGCC 
CCCCGCGGCC CTGCCCTCCC CGGCCAAGCG 
ATGCCGACCC CCCGGGAGGC GCGTCCAAGC 
GAGCTGGCCG AGGATCCCGC ATACGCCATT 
CACGGACTAT TACTTCTCCC AC C TGTTGGG 
AGGCCCTGTT TGGGAATAAC GCCAAGATCA 
TTTATTCCCG AAGTGTGGCA CCCCCCGGAC 
GGCCGCAGGG TTCGGGGCGG TGGGTGCCGG 
GTCGAATGTT GCATAGAGCC TTTGATACTC 
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SEQ.ID.NO. 8 Amino acid sequence of DNA polymerase for HSV1-F-M1 

1 MFSGGGGPLS PGGKSAARAA SGFFAPAGPR GAGRGPPPCL RQNFYNPYLA 
5 5 1 PVGTQQKPTG PTQRHTYYSE CDEFRFIAPR VLDEDAPPEK RAGVHDGHLK 

101 RAPKVYCGGD ERDVLRVGSG GFWPRRS RLW GGVDHAPAGF NPTVTVFHVY 
15 1 DILENVEHAY GMRAAQFHAR FMDAITPTGT YITLLGLTPE GHRVAVHVYG 

10 

201 TRQYFYMNKE EVDRHLQCRA PRDLCERMAA ALRESPGASF RGISADHFEA 
25 1 EVVERTDVYY YETRPALFYR VYVRSGRVLS YLCDNFCPAI KKYEGGVDAT 
15 301 TRFILDNPGF VTFGWYRLKP GRNNTLAQPR APMAFGTSSD VEFNCTADNL 

35 1 AIEGGMSDLP AYKLMCFDIE CKAGGEDELA FPVAGHPEDL VIQISCLLYD 
401 LSTTALEHVL LFSLGSCDLP ESHLNELAAR GLPTPVVLEF DSEFEMLLAF 

20 

45 1 MTLVKQYGPE FVTGYNIINF DWPFLLAKLT DIYKVPLDGY GRMNGRGVFR 
501 VWDIGQSHFQ KRS KIKVNGM VNIDMYGIIT DKIKLSSYKL NAVAEAVLKD 
25 55 1 KKKDLS YRDI PAY YAAGPAQ RGVIGEYCIQ DSLLVGQLFF KFLPHLELS A 

601 VARLAGINIT RTIYDGQQIR VFTCLLRLAD QKGFILPDTQ GRFRGGGGEA 
651 PKRPAAARED EERPEEEGED EDEREEGGGE REPEGARETA GRHVGYQGAR 

30 

701 VLDPTS GFH V NPVWFDFAS LYPSIIQAHN LCFSTLSLRA DAVAHLEAGK 
751 DYLEIEVGGR RLFFVKAHVR ESLLSILLRD WLAMRKQIRS RIPQSSPEEA 
35 801 VLLDKQQAAI KWCNSVYGF TGAQHGLLPC LHVAATVTTI GREMLLATRE 

851 YVHARWAAFE QLLADFPEAA DMRAPGPYSM RIIYGDTDSI FVLCRGLTAA 
901 GLTAVGDKMA SHISRALFLS PIKLECEKTF TKLLLIAKKK YIGVIYGGKM 

40 

951 LIKGVDLVRK NNCAFINRTS RALVDLLFYD DTVSGAAAAL AERPAEEWLA 
1001 RPLPEGLQAF GAVLVDAHRR ITDPERDIQD FVLTAELSRH PRAYTNKRLA 
45 105 1 HLTVYYKLMA RRAQVPSIKD RIPYVIVAQT REVEETVARL AALRELDAAA 

1 101 PGDEPAPPAA LPSPAKRPRE TPLHADPPGG AS KPRKLLVS ELAEDPAYAI 
1 15 1 AHGVALNTDY YFSHLLGAAC VTFKALFGNN AKITESLLKR FIPEVWHPPD 

50 

1201 DVAARLRAAG FGAVGAGATA EETRRMLHRA FDTLA* 



25/34 



WO 02/06513 PCT/US01/16525 

SEQ.ID.NO. 9 DNA sequence of HSV polymerase gene for HS V1-DJL-M1 

1 ATGTTTTCCG GTGGCGGCGG CCCGCTGTCC CCCGGAGGAA AGTCGGCGGC 
5 5 1 CAGGGCGGCG TCCGGGTTTT TTGCGCCCGC CGGCCCTCGC GGAGCCGGCC 

101 GGGGACCCCC GCCTTGTTTG AGGCAAAACT TTTACAACCC CTACCTCGCC 
151 CCAGTCGGGA CGCAACAGAA GCCGACCGGG CCAACCCAGC GCCATACGTA 

10 

201 CTATAGCGAA TGCGATGAAT TTCGATTCAT CGCCCCGCGG GTGCTGGACG 
25 1 AGGATGCCCC CCCGGAGAAG CGCGCCGGGG TGCACGACGG TCACCTCAAG 
15 301 CGCGCCCCCA AGGTGTACTG CGGGGGGGAC GAGCGCGACG TCCTCCGCGT 

35 1 CGGGTCGGGC GGCTTCTGGC CGCGGCGCTC GCGCCTGTGG GGCGGCGTGG 
401 ACCACGCCCC GGCGGGGTTC AACCCCACCG TCACCGTCTT TCACGTGTAT 

20 

451 GACATCCTGG AGAACGTGGA GCACGCGTAC GGCATGCGCG CGGCCCAGTT 
501 CCACGCGCGG TTTATGGACG CCATCACACC GACGGGGACC GTCATCACGC 
25 55 1 TCCTGGGCCT GACTCCGGAA GGCCACCGGG TGGCCGTTCA CGTTTACGGC 

601 ACGCGGCAGT ACTTTTACAT GAACAAGGAG GAGGTTGACA GGCACCTACA 
65 1 ATGCCGCGCC CC ACGAGATC TCTGCGAGCG C ATGGCCGCG GCCCTGCGCG 

30 

701 AGTCCCCGGG CGCGTCGTTC CGCGGCATCT CCGCGGACCA CTTCGAGGCG 
75 1 GAGGTGGTGG AGCGCACCGA CGTGTACTAC TACGAGACGC GCCCCGCTCT 
35 801 GTTTTACCGC GTCTACGTCC GAAGCGGGCG CGTGCTGTCG TACCTGTGCG 

851 ACAACTTCTG CCCGGCCATC AAGAAGTACG AGGGTGGGGT CGACGCCACC 
901 ACCCGGTTCA TCCTGGACAA CCCCGGGTTC GTCACCTTCG GCTGGTACCG 

40 

951 TCTCAAACCG GGCCGGAACA ACACGCTAGC CCAGCCGCGG GCCCCGATGG 
1001 CCTTCGGGAC ATCCAGCGAT GTCGAGTTTA ACTGTACGGC GGACAACCTG 
45 105 1 GCCATCGAGG GGGGCATGAG CGACCTACCG GCATACAAGC TCATGTGCTT 

1 101 CGATATCGAA TGCAAGGCGG GGGGGGAGGA CGAGCTGGCC TTTCCGGTGG 
1151 CCGGGC ACCC GG AGGACCTG GTC ATCC AG A TATCCTGTCT GCTCTACG AC 

50 

1201 CTGTCCACCA CCGCCCTGGA GCACGTCCTC CTGTTTTCGC TCGGTTCCTG 
1251 CGACCTCCCC GAATCCCACC TGAACGAGCT GGCGGCCAGG GGCCTGCCCA 
55 1301 CGCCCGTGGT TCTGGAATTC GACAGCGAAT TCGAGATGCT GTTGGCCTTC 

135 1 ATGACCCTTG TGAAACAGTA CGGCCCCGAG TTCGTGACCG GGTACAACAT 
1401 AATCAACTTC GACTGGCCCT TCTTGCTGGC CAAGCTGACG GACATTTACA 
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145 1 AGGTCCCCCT GGACGGGTAC GGCCGCATGA ACGGCCGGGG CGTGTTTCGC 
1 50 1 GTGTGGG AC A TAGGCCAG AG CC ACTTCC AG AAGCGCAGC A AG ATAA AGGT 

5 

1551 GAACGGCATG GTGAACATCG ACATGTACGG GATTATAACC GACAAGATCA 
1601 AGCTCTCGAG CTACAAGCTC AACGCCGTGG CCGAAGCCGT CCTGAAGGAC 
10 1651 AAGAAGAAGG ACCTGAGCTA TCGCGACATC CCCACCTACT ACGCCGCCGG 

1701 GCCCGCGCAA CGCGGGGTGA TCGGCGAGTA CTGCATACAG GATTCCCTGC 
1751 TGGTGGGCCA GCTGTTTTTT AAGTTTTTGC CCCATCTGGA GCTCTCGGCC 

15 

1 801 GTCGCGCGCT TGGCGGGTAT TAACATCACC CGCACCATCT ACGACGGCCA 
185 1 GCAGATCCGC GTCTTTACGT GCCTGCTGCG CCTGGCCGAC CAGAAGGGCT 
20 1901 TTATTCTGCC GGACACCCAG GGGCGATTTA GGGGCGCCGG GGGGGAGGCG 

1951 CCCAAGCGTC CGGCCGCAGC CCGGGAGGAC GAGGAGCGGC CAGAGGAGGA 
2001 GGGGGAGGAC GAGAACGAAC GCGAGGAGGG CGGGGGCGAG CGGGAGCCGG 

25 

2051 AGGGCGCGCG GGAGACCGCC GGCCGGCACG TGGGGTACCA GGGGGCCAGG 
2101 GTCCTTGACC CCACTTCCGG GTTTCACGTG AACCCCGTGG TGGTGTTCGA 
30 2151 CTTTGCCAGC CTGTACCCCA GCATCATCCA GGCCCACAAC CTGTGCTTCA 

2201 GCACGCTCTC CCTGAGGGCC GACGCAGTGG CGCACCTGGA GGCGGGCAAG 
225 1 GACTACCTGG AGATCGAGGT GGGGGGGCGA CGGCTGTTCT TCGTCAAGGC 

35 

2301 TCACGTGCGA GAGAGCCTCC TCAGCATCCT CCTGCGGGAC TGGCTCGCCA 
235 1 TGCGAAAGCA GATCCGCTCG CGGATTCCCC AGAGCAGCCC CGAGGAGGCC 
40 2401 GTGCTCCTGG ACAAGCAGCA GGCCGCCATC AAGGTCGTGT GTAACTCGGT 

245 1 TTACGGGTTC ACGGGAGCGC AGCACGGACT CCTGCCGTGC CTGC ACGTTG 
2501 CCGCGACGGT GACGACCATC GGCCGCGAGA TGCTGCTCGC GACCCGCGAG 

45 

2551 TACGTCCACG CGCGCTGGGC GGCCTTCGAA CAGCTCCTGG CCGATTTCCC 
2601 GGAGGCGGCC GACATGCGCG CCCCCGGGCC CTATTCCATG CGCATCATCT 
50 265 1 ACGGGGACAC GGACTCCATA TTTGTGCTGT GCCGCGGCCT CACGGCCGCC 

2701 GGGCTGACGG CCGTGGGCGA CAAGATGGCG AGCCACATCT CGCGCGCGCT 
275 1 GTTTCTGCCC CCC ATCAAAC TCGAGTGCGA AAAG ACGTTC ACCAAGCTGC 

55 

2801 TGCTGATCGC CAAGAAAAAG TACATCGGCG TCATCTACGG GGGTAAGATG 
2851 CTCATCAAGG GCGTGGATCT GGTGCGCAAA AACAACTGCG CGTTTATCAA 
60 2901 CCGCACCTCC AGGGCCCTGG TCGACCTGCT GTTTTACGAC GATACCGTAT 
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2951 CCGGAGCGGC CGCCGCGTTA GCCGAGCGCC CCGCAGAGGA GTGGCTGGCG 
3001 CGACCCCTGC CCGAGGGACT GCAGGCGTTC GGGGCCGTCC TCGTAGACGC 

5 

305 1 CCATCGGCGC ATCACCGACC CGGAGAGGGA CATCCAGGAC TTTGTTCTCA 
3101 CCGCCGAACT GAGCAGACAC CCGCGCGCGT ACACCAACAA GCGCCTGGCC 
10 3151 CACCTGACGG TGTATTACAA GCTCATGGCC CGCCGCGCGC AGGTCCCGTC 
3201 CATCAAGGAC CGGATCCCGT ACGTGATCGT GGCCCAGACC CGCGAGGTAG 
3251 AGGAGACGGT CGCGCGGCTG GCCGCCCTCC GCGAGCTAGA CGCCGCCGCC 

15 

3301 CCAGGGGACG AGCCCGCCCC CCCCGCGGCC CTGCCCTCCC CGGCCAAGCG 
3351 CCCCCGGGAG ACGCCGTCGC CTGCCGACCC CCCGGGAGGC GCGTCCAAGC 
20 3401 CCCGCAAGCT GCTGGTGTCC GAGCTGGCCG AGGATCCCGC ATACGCCATT 

345 1 GCCCACGGCG TCGCCCTGAA CACGGACTAT TACTTCTCCC ACCTGTTGGG 
3501 GGCGGCGTGC GTGACATTCA AGGCCCTGTT TGGGAATAAC GCCAAGATCA 

25 

355 1 CCGAGAGTCT GTTAAAAAGG TTTATTCCCG AAGTGTGGCA CCCCCCGGAC 
3601 GACGTGGCCG CGCGGCTCCG GACCGCAGGG TTCGGGGCGG TGGGTGCCGG 
30 3651 CGCTACGGCG GAGGAAACTC GTCGAATGTT GCATAGAGCC TTTGATACTC 

3701 TAGCATGA 
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SEQ.ID.NO. 10 Amino acid sequence of DNA polymerase for HSV1-DJL-M1 

1 MFSGGGGPLS PGGKSAARAA SGFFAPAGPR GAGRGPPPCL RQNFYNPYLA 
5 5 1 PVGTQQKPTG PTQRHTYYSE CDEFRFIAPR VLDEDAPPEK RAGVHDGHLK 

101 RAPKVYCGGD ERDVLRVGSG GFWPRRSRLW GGVDHAPAGF NPTVTVFHVY 
151 DILENVEHAY GMRAAQFHAR FMDAITPTGT VITLLGLTPE GHRVAVHVYG 

10 

201 TRQYFYMNKE EVDRHLQCRA PRDLCERMAA ALRESPGASF RGISADHFEA 
251 EVVERTDVYY YETRPALFYR VYVRSGRVLS YLCDNFCPAI KKYEGGVDAT 
15 301 TRFILDNPGF VTFGWYRLKP GRNNTLAQPR APMAFGTSSD VEFNCTADNL 

35 1 AIEGGMSDLP A YKLM CFDIE CKAGGEDELA FPVAGHPEDL VIQISCLLYD 
401 LSTTALEHVL LFSLGSCDLP ESHLNELAAR GLPTPVVLEF DSEFEMLLAF 

20 

45 1 MTLVKQYGPE FVTGYNIINF DWPFLLAKLT DIYKVPLDGY GRMNGRGVFR 
501 VWDIGQSHFQ KRSKIKVNGM VNIDMYGIIT DKIKLSSYKL NAVAEAVLKD 
25 55 1 KKKDLS YRDI PTY YAAGPAQ RGVIGEYCIQ DSLLVGQLFF KFLPHLELS A 

601 VARLAGINIT RTIYDGQQIR VFTCLLRLAD QKGFILPDTQ GRFRGAGGEA 
65 1 PKRPAAARED EERPEEEGED ENEREEGGGE REPEGARETA GRHVGYQGAR 

30 

701 VLDPTS GFH V NPVVVFDFAS LYPSIIQAHN LCFSTLSLRA DAVAHLEAGK 
75 1 DYLEIEVGGR RLFFVKAHVR ESLLSILLRD WLAMRKQIRS RIPQSSPEEA 
35 801 VLLDKQQAAI KVVCNSVYGF TGAQHGLLPC LHVAATVTTI GREMLLATRE 

851 YVHARWAAFE QLLADFPEAA DMRAPGPYSM RIIYGDTDSI FVLCRGLTAA 
901 GLTAVGDKMA SHISRALFLP PIKLECEKTF TKLLLIAKKK YIGVIYGGKM 

40 

951 LIKGVDLVRK NNCAFINRTS RALVDLLFYD DTVSGAAAAL AERPAEEWLA 
1001 RPLPEGLQAF GAVLVDAHRR ITDPERDIQD FVLTAELSRH PRAYTNKRLA 
45 1 05 1 HLTVYYKLMA RRAQ VPSIKD RIP Y VI VAQT REVEET VARL AALRELD AAA 

1 101 PGDEPAPPAA LPSPAKRPRE TPSPADPPGG ASKPRKLLVS ELAEDPAYAI 
1 15 1 AHGVALNTDY YFSHLLGAAC VTFKALFGNN AKITESLLKR FIPEVWHPPD 

50 

1201 DVAARLRTAG FGAVGAGATA EETRRMLHRA FDTLA* 
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SEQ.ID.NO. 11 DNA sequence of DNA polymerase gene for HMC V-AD 1 69-M 1 

1 ATGTTTTTCA ACCCGTATCT GAGCGGCGGC GTGACCGGCG GTGCGGTCGC 
5 51 GGGTGGCCGG CGTCAGCGTT CGCAGCCCGG CTCCGCGCAG GGCTCGGGCA 

101 AGCGGCCGCC ACAGAAACAG TTTTTGCAGA TCGTGCCGCG AGGTGTCATG 
151 TTCGACGGTC AGACGGGGTT GATCAAGCAT AAGACGGGAC GGCTGCCTCT 

10 

201 CATGTTCTAT CGAGAGATTA AACATTTGTT GAGTCATGAC ATGGTTTGGC 
251 CGTGTCCTTG GCGCGAGACC CTGGTGGGTC GCGTGGTGGG ACCTATTCGT 
15 301 TTTCACACCT ACGATCAGAC GGACGCCGTG CTCTTCTTCG ACTCGCCCGA 

351 AAACGTGTCG CCGCGCTATC GTCAGCATCT GGTGCCTTCG GGGAACGTGT 
401 TGCGTTTCTT CGGGGCCACA GAACACGGCT ACAGTATCTG CGTCAACGTT 

20 

45 1 TTCGGGC AGC GCAGCTACTT TTACTGTGAG TACAGCGACA CCGATAGGCT 
501 GCGTGAGGTC ATTGCCAGCG TGGGCGAACT AGTGCCCGAA CCGCGGACGC 
25 55 1 CATACGCCGT GTCTGTCACG CCGGCCACCA AGACCTCCAT CTATGGGTAC 

601 GGGACGCGAC CCGTGCCCGA TTTGCAGTGT GTGTCTATCA GCAACTGGAC 
651 CATGGCCAGA AAAATCGGCG AGTATCTGCT GGAGCAGGGT TTTCCCGTGT 

30 

701 ACGAGGTCCG TGTGGATCCG CTGACGCGTT TGGTCATCGA TCGGCGGATC 
751 ACCACGTTCG GCTGGTGCTC CGTGAATCGT TACGACTGGC GGCAGCAGGG 
35 801 TCGCGCGTCG ACTTGTGATA TCGAGGTAGA CTGCGATGTC TCTGACCTGG 

851 TGGCTGTGCC CGACGACAGC TCGTGGCCGC GCTATCGATG CCTGTCCTTC 
901 GATATCGAGT GCATGAGCGG CGAGGGTGGT TTTCCCTGCG CCGAGAAGTC 

40 

95 1 CGATGAC ATT GTCATTCAGA TCTCGTGCGT GTGCT ACGAG ACGGGGGGAA 
1001 ACACCGCCGT GGATCAGGGG ATCCCAAACG GGAACGATGG TCGGGGCTGC 
45 105 1 ACTTCGGAGG GTGTGATCTT TGGGCACTCG GGTCTTCATC TCTTTACGAT 

1 101 CGGCACCTGC GGGCAGGTGG GCCCAGACGT GGACGTCTAC GAGTTCCCTT 
1 151 CCGAATACGA GCTGCTGCTG GGCTTTATGC TTTTCTTTCA ACGGTACGCG 

50 

1201 CCGGCCTTTG TGACCGGTTA CAACATCAAC TCTTTTGACT TGAAGTACAT 
1251 CCTCACGCGT CTCGAGTACC TGTATAAGGT GGACTCGCAG CGCTTCTGCA 
55 1301 AGTTGCCTAC GGCGCAGGGC GGCCGTTTCT TTTTACACAG CCCCGCCGTG 

1351 GGTTTTAAGC GGCAGTACGC CGCCGCTTTT CCCTCGGCTT CTCACAACAA 
1401 TCCGGCCAGC ACGGCCGCCA CCAAGGTGTA TATTGCGGGT TCGGTGGTTA 

30/34 



WO 02/06513 



PCT/US01/16525 



1451 TCGACATGTA CCCTGTATGC ATGGCCAAGA CTAACTCGCC CAACTATAAG 
1501 CTCAACACTA TGGCCGAGCT TTACCTGCGG CAACGCAAGG ATGACCTGTC 

* 5 

1551 TTACAAGGAC ATCCCGCGTT GTTTCGTGGC TAATGCCGAG GGCCGCGCCC 
1601 AGGTAGGCCG TTACTGTCTG CAGGACGCCG TATTGGTGCG CGATCTGTTC 
10 1651 AACACCATTA ATTTTCACTA CGAGGCCGGG GCCATCGCGC GGCTGGCTAA 

1701 AATTCCGTTG CGGCGTGTCA TCTTTGACGG ACAGCAGATC CGTATCTACA 
1751 CCTCGCTGCT GGACGAGTGC GCCTGCCGCG ATTTTATCCT GCCCAACCAC 

15 

1801 TACAGCAAAG GTACGACGGT GCCCGAAACG AATAGCGTTG CTGTGTCACC 
185 1 TAACGCTGCT ATCATCTCTA CCGCCGCTGT GCCCGGCGAC GCGGGTTCTG 
20 1901 TGGCGGCTAT GTTTCAGATG TCGCCGCCCT TGCAATCTGC GCCGTCCAGT 

1951 CAGGACGGCG TTTCACCCGG CTCCGGCAGT AACAGTAGTA GCAGCGTCGG 
2001 CGTTTTCAGC GTCGGCTCCG GCAGTAGTGG CGGCGTCGGC GTTTCCAACG 

25 

205 1 ACAATCACGG CGCCGGCGGT ACTGCGGCGG TTTCGTACC A GGGCGCCACG 
2101 GTGTTTGAGC CCGAGGTGGG TTACTACAAC GACCCCGTGG CCGTGTTCGA 
30 2151 GTTTGCCAGC CTCTACCCTT CCATCATCAT GGCCCACAAC CTCTGCTACT 
2201 CCACCCTGCT GGTGCCGGGT GGCGAGTACC CTGTGGACCC CGCCGACGTA 
2251 TACAGCGTCA CGCTAGAGAA CGGCGTGACC CACCGCTTTG TGCGTGCTTC 

35 

2301 GGTGCGCGTC TCGGTGCTCT CGGAACTGCT CAACAAGTGG GTTTCGCAGC 
235 1 GGCGTGCCGT GCGCGAATGC ATGCGCGAGT GTCAAGACCC TGTGCGCQGT 
40 2401 ATGCTGCTCG ACAAGGAACA GATGGCGCTC AAAGTAACGT GCAACGCTTT 
2451 CTACGGTTTT ACCGGCGCGC TGAACGGTAT GATGCCGTGT CTGCCCATCG 
2501 CCGCCAGCAT CACGCGCATC GGTCGCGACA TGCTAGAGCG CACGGCGCGG 

45 

255 1 TTCATCAAAG ACAACTTTTC AGAGCCGTGT TTTTTGCACA ATTTTTTTAA 
2601 TCAGGAAGAC TATGTAGTGG GAACGCGGGA GGGGGATTCG GAGGAGAGCA 
50 2651 GCGCGTTACC GGAGGGGCTC GAAACATCGT CAGGGGGCTC GAACGAACGG 
2701 CGGGTGGAGG CGCGGGTCAT CTACGGGGAC ACGGACAGCG TGTTTGTCCG 
2751 GTTTCGTGGC CTGACGCCGC AGGCTCTGGT GGCGCGTGGG CCCAGCCTGG 

55 

2801 CGCACTACGT GACGGCCTGT CTTTTTGTGG AGCCCGTCAA GCTGGAGTTT 
285 1 GAAAAGGTCT TCGTCTCTCT TATGATGATC TGCAAGAAAC GTTACATCGG 
60 2901 CAAAGTGGAG GGCGCCTCGG GTCTGAGCAT GAAGGGCGTG GATCTGGTGC 
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295 1 GCAAGACGGC CTGCGAGTTC GTCAAGGGCG TCACGCGTGA CGTCCTCTCG 
3001 CTGCTCTTTG AGGATCGCGA GGTCTCGGAA GCAGCCGTGC GCCTGTCGCG 

5 

305 1 CCTCTCACTC GATGAAGTCA AGAAGTACGG CGTGCCACGC GGTTTCTGGC 
3101 GTATCTTACG CCGCTTGGTG CAGGCCCGCG ACGATCTGTA CCTGCACCGT 
10 3151 GTGCGTGTCG AGGACCTGGT GCTTTCGTCG GTGCTCTCTA AGGACATCTC 
3201 GCTGTACCGT CAATCTAACC TGCCGCACAT TGCCGTCATT AAGCGATTGG 
325 1 CGGCCCGTTC TGAGGAGCTA CCCTCGGTCG GGGATCGGGT CTTTTACGTT 

15 

3301 CTGACGGCGC CCGGTGTCCG GACGGCGCCG CAGGGTTCCT CCGACAACGG 
335 1 TGATTCTGTA ACCGCCGGCG TGGTTTCCCG GTCGGACGCG ATTGATGGCA 
20 3401 CGGACGACGA CGCTGACGGC GGCGGGGTAG AGGAGAGCAA CAGGAGAGGA 
3451 GGAGAGCCGG CAAAGAAGAG GGCGCGGAAA CCACCGTCGG CCGTGTGCAA 
3501 CTACGAGGTA GCCGAAGATC-CGAGCTACGT GCGCGAGCAC GGCGTGCCCA 

25 

355 1 TTCACGCCGA CAAGTACTTT GAGCAGGTTC TCAAGGCTGT AACTAACGTG 
3601 CTGTCGCCCG TCTTTCCCGG CGGCGAAACC GCGCGCAAGG ACAAGTTTTT 
30 365 1 GCACATGGTG CTGCCGCGGC GCTTGCACTT GGAGCCGGCT TTTCTGCCGT 
3701 ACAGTGTCAA GGCGCACGAA TGCTGTTGA 
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SEQ • id. NO . 12 Amino acid sequence of DNA polymerase for HCMV-AD169-M1 

1 MFFNPYLSGG VTGGAVAGGR RQRSQPGSAQ GSGKRPPQKQ FLQIVPRGVM 

5 

5 1 FDGQTGLIKH KTGRLPLMFY REIKHLLSHD MVWPCPWRET LVGRVVGPIR 
101 FHTYDQTDAV LFFDSPENVS PRYRQHLVPS GNVLRFFGAT EHGYSICVNV 
10 151 FGQRS YFYCE YSDTDRLREV IAS VGELVPE PRTPY AVSVT P ATKTSI YGY 

201 GTRPVPDLQC YSISNWTMAR KIGEYLLEQG FPVYEVRVDP LTRLVIDRRI 
251 TTFGWCSVNR YDWRQQGRAS TCD1EVDCDV SDLVAVPDDS SWPRYRCLSF 

15 

301 DIECMSGEGG FPCAEKSDDI VIQISCVCYE TGGNTAVDQG IPNGNDGRGC 
35 1 TSEGVIFGHS GLHLFTIGTC GQVGPDVDVY EFPSEYELLL GFMLFFQRYA 
20 40 1 PAFVTGYNIN SFDLKYILTR LEYLYKVDSQ RFCKLPTAQG GRFFLHSPAV 

45 1 GFKRQYAAAF PS ASHNNPAS TAATKVYIAG S VVIDMYPVC MAKTNSPNYK 
501 LNTMAELYLR QRKDDLSYKD IPRCFVANAE GRAQVGRYCL QDAVLVRDLF 

25 

55 1 NTINFHYEAG AIARLAKIPL RRVIFDGQQI RIYTSLLDEC ACRDFILPNH 
601 YSKGTTVPET NSVAVSPNAA IISTAAVPGD AGSVAAMFQM SPPLQSAPSS 
30 65 1 QDGVSPGSGS NSSSSVGVFS VGSGSSGGVG VSNDNHGAGG TAAVS YQGAT 

701 VFEPEVGYYN DPVAVFDFAS LYPSIIMAHN LCYSTLLVPG GEYPVDPADV 
75 1 YSVTLENGVT HRFVRAS VRV S VLSELLNKW VSQRRAVREC MRECQDPVRR 

35 

801 MLLDKEQMAL KVTCNAFYGF TGALNGMMPC LPIAASITRI GRDMLERTAR 
85 1 FIKDNFSEPC FLHNFFNQED YWGTREGDS EESSALPEGL ETSSGGSNER 
40 901 RVEARVIYGD TDS VFVRFRG LTPQALVARG PSLAHYVTAC LFVEPVKLEF 

951 EKVFVSLMMI CKKRYIGKVE GASGLSMKGV DLVRKTACEF VKGVTRDVLS 
1001 LLFEDREVSE AAVRLSRLSL DEVKKYGVPR GFWRILRRLV QARDDLYLHR 

45 

1051 VRVEDLVLSS VLSKDISLYR QSNLPHIAVI KRLAARSEEL PSVGDRVFYV 
1101 LTAPGVRTAP QGSSDNGDSV TAGVVSRSDA IDGTDDDADG GGVEESNRRG 
50 1151 GEPAKKRARK PPSAVCNYEV AEDPSYVREH GVPIHADKYF EQVLKAVTNV 

1201 LSPVFPGGET ARKDKFLHMV LPRRLHLEPA FLPYSVKAHE CC* 
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Figure 6 

SEQ.ID.NO.13 Amino acid sequence of DNA polymerase for HCMV-AD169 

5 1 MFFNPYLSGG VTGGAVAGGR RQRSQPGS AQ GSGKRPPQKQ FLQIVPRGVM 

5 1 FDGQTGLIKH KTGRLPLMFY REIKHLLSHD MVWPCPWRET LVGRVVGPIR 
101 FHTYDQTDAV LFFDSPENVS PRYRQHLVPS GNVLRFFGAT EHGYSICVNV 

10 

151 FGQRSYFYCE YSDTDRLREV IASVGELVPE PRTPYAVSVT PATKTSIYGY 
201 GTRPVPDLQC VSISNWTMAR KIGEYLLEQG FPVYEVRVDP LTRLVIDRRI 
15 251 TTFGWCS VNR YDWRQQGRAS TCDIEVDCDV SDLVAVPDDS SWPRYRCLSF 

301 DIECMSGEGG FPCAEKSDDI VIQISCVC YE TGGNTAVDQG IPNGNDGRGC 
351 TSEGVIFGHS GLHLFTIGTC GQVGPDVDVY EFPSEYELLL GFMLFFQRYA 

20 

401 PAFVTGYNIN SFDLKYILTR LEYLYKVDSQ RFCKLPTAQG GRFFLHSPAV 
451 GFKRQYAAAF PSASHNNPAS TAATKVYIAG SVVIDMYPVC MAKTNSPNYK 
25 501 LNTMAELYLR QRKDDLSYKD IPRCFVANAE GRAQVGRYCL QDAVLVRDLF 

55 1 NTINFHYEAG AIARLAKIPL RRVIFDGQQI RIYTSLLDEC ACRDFILPNH 
601 YSKGTTVPET NSVAVSPNAA IISTAAVPGD AGSVAAMFQM SPPLQSAPSS 

30 

65 1 QDGVSPGSGS NSSSSVGVFS VGSGSSGGVG VSNDNHGAGG TAAVSYQGAT 
701 VFEPEVGYYN DPVAVFDFAS LYPSIIMAHN LCYSTLLVPG GEYPVDPADV 
35 75 1 YSVTLENGVT HRFVRASVRV SVLSELLNKW VSQRRAVREC MRECQDPVRR 

801 MLLDKEQMAL KVTCNAFYGF TGVVNGMMPC LPIAASITRI GRDMLERTAR 
851 FIKDNFSEPC FLHNFFNQED YVVGTREGDS EESSALPEGL ETSSGGSNER 

40 

901 RVEARVIYGD TDSVFVRFRG LTPQALVARG PSLAHYVTAC LFVEPVKLEF 
951 EKVFVSLMMI CKKRYIGKVE GASGLSMKGV DLVRKTACEF VKGVTRDVLS 
45 1001 LLFEDREVSE AAVRLSRLSL DEVKKYGVPR GFWRILRRLV QARDDLYLHR 

1051 VRVEDLVLSS VLSKDISLYR QSNLPHIAVI KRLAARSEEL PSVGDRVFYV 
1 101 LTAPGVRTAP QGSSDNGDSV TAGVVSRSDA IDGTDDDADG GGVEESNRRG 

50 

1151 GEPAKKRARK PPS AVCNYEV AEDPS YVREH GVPIHADKYF EQVLKAVTNV 
1201 LSPVFPGGET ARKD KFLHM V LPRRLHLEPA FLPYSVKAHE CC* 

55 
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SEQUENCE LISTING 

<110> Homa, Fred 

Wathen, Michael 
Hopkins, Todd 
Thomsen, Darrell 

<12 0> A Method for Treating Herpes Virus 

<130> 00221 

<160> 19 

<170> Patentln version 3.0 

<210> 1 

<211> 3717 

<212> DNA 

<213> herpes simplex 

<400> 1 



atgttttgtg 


ccgcgggcgg 


cccgacttcc 


cccgggggga 


agtcggcggc 


tcgggcggcg 


60 


tctgggtttt 


ttgcccccca 


caacccccgg 


ggagccaccc 


agacggcacc 


gccgccttgc 


120 


cgccggcaga 


acttctacaa 


cccccacctc 


gctcagaccg 


gaacgcagcc 


aaaggccccc 


180 


gggccggctc 


agcgccatac 


gtactacagc 


gagtgcgacg 


aatttcgatt 


tatcgccccg 


240 


cgttcgctgg 


acgaggacgc 


ccccgcggag 


cagcgcaccg 


gggtccacga 


cggccgcctc 


300 


cggcgcgccc 


ctaaggtgta 


ctgcgggggg 


gacgagcgcg 


acgtcctccg 


cgtgggcccg 


360 


gagggctfcct 


ggccgcgtcg 


cttgcgcctg 


tggggcggtg 


cggaccatgc 


ccccaagggg 


420 


ttcgacccca 


ccgtcaccgt 


cttccacgtg 


tacgacatcc 


tggagcacgt 


ggaacacgcg 


480 


tacagcatgc 


gcgccgccca 


gctccacgag 


cgatttatgg 


acgccatcac 


gcccgccggg 


540 


accgtcatca 


cgcttctggg 


tctgaccccc 


gaaggccatc 


gcgtcgccgt 


tcacgtctac 


600 


ggcacgcggc 


agtactttta 


catgaacaag 


gcggaggtgg 


atcggcacct 


gcagtgccgt 


660 


gccccgcgcg 


atctctgcga 


gcgcctggcg 


gcggccctgc 


gcgagtcgcc 


gggggcgtcg 


72 0 


ttccgcggca 


tctccgcgga 


ccacttcgag 


gcggaggtgg 


tggagcgcgc 


cgacgtgtac 


780 


tattacgaaa 


cgcgcccgac 


cctgtactac 


cgcgtcttcg 


tgcgaagcgg 


gcgcgcgctg 


840 


gcctacctgt 


gcgacaactt 


ttgccccgcg 


atcaggaagt 


acgagggggg 


cgtcgacgcc 


900 


accacccggt 


ttatcctgga 


caacccgggg 


tttgtcacct 


tcggctggta 


ccgcctcaag 


960 


cccggccgcg 


ggaacgcgcc 


ggcccaaccg 


cgccccccga 


cggcgttcgg 


aacctcgagc 


1020 


gacgtcgagt 


ttaactgcac 


ggcggacaac 


ctggccgtcg 


agggggccat 


gtgtgacctg 


1080 


ccggcctaca 


agctcatgtg 


cttcgatatc 


gaatgcaagg 


ccggggggga 


ggacgagctg 


1140 


gcctttccgg 


tcgcggaacg 


cccggaagac 


ctcgtcatcc' 


agatctcctg' tctgctctac 


1200 


gacctgtcca 


ccaccgccct 


cgagcacatc 


ctcctgtttt 


cgctcggatc 


ctgcgacctc 


1260 
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cccgagtccc 


acctcagcga 


tctcgcctcc 


aggggcctgc 


cggcccccgt 


cgtcctggag 


1320 


tttgacagcg 


aattcgagat 


gctgctggcc 


ttcatgacct 


tcgtcaagca 


gtacggcccc 


1380 


gagttcgtga 


ccgggtacaa 


catcatcaac 


ttcgactggc 


ccttcgtcct 


gaccaagctg 


1440 


acggagatct 


acaaggtccc 


gctcgacggg 


tacgggcgca 


tgaacggccg 


gggtgtgttc 


1500 


cgcgtgtggg 


acatcggcca 


gagccacttt 


cagaagcgca 


gcaagatcaa 


ggtgaacggg 


1560 


atggtgaaca 


tcgacatgta 


cggcatcatc 


accgacaagg 


tcaaactctc 


cagctacaag 


1620 


ctgaacgccg 


tcgccgaggc 


cgtcttgaag 


gacaagaaga 


aggatctgag 


ctaccgcgac 


1680 


atccccgcct 


actacgcctc 


cgggcccgcg 


cagcgcgggg 


tgatcggcga 


gtattgtgtg 


1740 


caggactcgc 


tgctggtcgg 


gcagctgttc 


ttcaagtttc 


tgccgcacct 


ggagctttcc 


1800 


gccgtcgcgc 


gcctggcggg 


catcaacatc 


acccgcacca 


tctacgacgg 


ccagcagatc 


1860 


cgcgtcttca 


cgtgcctcct 


gcgccttgcg 


ggccagaagg 


gcttcatcct 


gccggacacc 


1920 


caggggcggt 


ttcggggcct 


cgacaaggag 


gcgcccaagc 


gcccggccgt 


gcctcggggg 


1980 


gaaggggagc 


ggccggggga 


cgggaacggg 


gacgaggata 


aggacgacga 


cgaggacgag 


2040 


gacggggacg 


agcgcgagga 


ggtcgcgcgc 


gagaccgggg 


gccggcacgt 


tgggtaccag 


2100 


ggggcccggg 


tcctcgaccc 


cacctccggg 


tttcacgtcg 


accccgtggt 


ggtgtttgac 


2160 


tttgccagcc 


tgtaccccag 


catcatccag 


gcccacaacc 


tgtgcttcag 


tacgctctcc 


2220 


ctgcggcccg 


aggccgtcgc 


gcacctggag 


gcggaccggg 


actacctgga 


gatcgaggtg 


2280 


gggggccgac 


ggctgttctt 


cgtgaaggcc 


cacgtacgcg 


agagcctgct 


gagcatcctg 


2340 


ctgcgcgact 


ggctggccat 


gcgaaagcag 


atccgctcgc 


ggatccccca 


gagcaccccc 


2400 


gaggaggccg 


tcctcctcga 


caagcaacag 


gccgccatca 


aggtggtgtg 


caactcggtg 


2460 


tacgggttca 


ccggggcgca 


gcacggtctt 


ctgccctgcc 


tgcacgtggc 


cgccaccgtg 


2520 


acgaccatcg 


gccgcgagat 


gctcctcgcg 


acgcgcgcgt 


acgtgcacgc 


gcgctgggcg 


2580 


gagttcgatc 


agctgctggc 


cgactttccg 


gaggcggccg 


gcatgcgcgc 


ccccggtccg 


2640 


tactccatgc 


gcatcatcta 


cggggacacg 


gactccattt 


tcgttttgtg 


ccgcggcctc 


2700 


acggccgcgg 


gcctggtggc 


catgggcgac 


aagatggcga 


gccacatctc 


gcgcgcgctg 


2760 


ttcctccccc 


cgatcaagct 


cgagtgcgaa 


aaaacgttca 


ccaagctgct 


gctcatcgcc 


2820 


aagaaaaagt 


acatcggcgt 


catctgcggg 


ggcaagatgc 


tcatcaaggg 


cgtggatctg 


2880 


gtgcgcaaaa 


acaactgcgc 


gtttatcaac 


cgcacctcca 


gggccctggt 


cgacctgctg 


2940 


ttttacgacg 


ataccgtatc 


cggagcggcc 


gccgcgttag 


ccgagcgccc 


cgcagaggag 


3000 


tggctggcgc 


gacccctgcc 


cgagggactg 


caggcgttcg 


gggccgtcct 


cgtagacgcc 


3060 


catcggcgca 


tcaccgaccc 


ggagagggac 


atccaggact 


ttgtcctcac 


cgccgaactg 


3120 
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agcagacacc 


cgcgcgcgta 


caccaacaag 


cgcctggccc 


acctgacggt 


gtattacaag 


3180 


ctcatggccc 


gccgcgcgca 


ggtcccgtcc 


atcaaggacc 


ggatcccgta 


cgtgatcgtg 


3240 


gcccagaccc 


gcgaggtaga 


ggagacggtc 


gcgcggctgg 


ccgccctccg 


cgagctagac 


3300 


gccgccgccc 


caggggacga 


gcccgccccc 


ccagcggccc 


tgccctcccc 


ggccaagcgc 


3360 


ccccgggaga 


cgccgtcgca 


tgccgacccc 


ccgggaggcg 


cgtccaagcc 


ccgcaagctg 


3420 


ctggtgtccg 


agctggcgga 


ggatcccggg 


tacgccatcg 


cccggggcgt 


tccgctcaac 


3480 


acggactatt 


acttctcgca 


cctgctgggg 


gcggcctgcg 


tgacgttcaa 


ggccctgttt 


3540 


ggaaataacg 


ccaagatcac 


cgagagtctg 


ttaaagaggt 


ttattcccga 


gacgtggcac 


3600 


cccccggacg 


acgtggccgc 


gcggctcagg 


gccgcggggt 


tcgggccggc 


gggggccggc 


3660 


gctacggcgg 


aggaaactcg 


tcgaatgttg 


catagagcct 


ttgatactct 


agcatga 


3717 



<210> 2 

<211> 1238 

<212> PRT 

<213> herpes simplex 

<400> 2 

Met Phe Cys Ala Ala Gly 
1 5 

Ala Arg Ala Ala Ser Gly 
20 

Thr Gin Thr Ala Pro Pro 
35 

His Leu Ala Gin Thr Gly 
50 

Arg His Thr Tyr Tyr Ser 
65 70 

Arg Ser Leu Asp Glu Asp 
85 

Asp Gly Arg Leu Arg Arg 
100 

Arg Asp Val Leu Arg Val 
115 

Arg Leu Trp Gly Gly Ala 
130 

Val Thr Val Phe His Val 
145 150 

Tyr Ser Met Arg Ala Ala 
165 



Gly Pro Thr Ser Pro Gly Gly Lys Ser Ala 
10 15 

Phe Phe Ala Pro His Asn Pro Arg Gly Ala 
25 30 

Pro Cys Arg Arg Gin Asn Phe Tyr Asn Pro 
40 45 

Thr Gin Pro Lys Ala Pro Gly Pro Ala Gin 
55 60 

Glu Cys Asp Glu Phe Arg Phe lie Ala Pro 
75 80 

Ala Pro Ala Glu Gin Arg Thr Gly Val His 
90 95 

Ala Pro Lys Val Tyr Cys Gly Gly Asp Glu 
105 110 

Gly Pro Glu Gly Phe Trp Pro Arg Arg Leu 
120 125 

Asp His Ala Pro Lys Gly Phe Asp Pro Thr 
135 140 

Tyr Asp lie Leu Glu./His Val Glu His Ala 

155 ' 160 

Gin Leu His Glu Arg 'Phe Met Asp Ala lie 
170 175 
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Thr Pro Ala Gly Thr Val lie Thr Leu Leu Gly Leu Thr Pro Glu Gly 
180 185 190 

His Arg Val Ala Val His Val Tyr Gly Thr Arg Gin Tyr Phe Tyr Met 
195 200 205 

Asn Lys Ala Glu Val Asp Arg His Leu Gin Cys Arg Ala Pro Arg Asp 
210 215 220 

Leu Cys Glu Arg Leu Ala Ala Ala Leu Arg Glu Ser Pro Gly Ala Ser 
225 230 235 240 

Phe Arg Gly lie Ser Ala Asp His Phe Glu Ala Glu Val Val Glu Arg 
245 250 255 

Ala Asp Val Tyr Tyr Tyr Glu Thr Arg Pro Thr Leu Tyr Tyr Arg Val 
260 265 270 

Phe Val Arg Ser Gly Arg Ala Leu Ala Tyr Leu Cys Asp Asn Phe Cys 
275 280 285 

Pro Ala lie Arg Lys Tyr Glu Gly Gly Val Asp Ala Thr Thr Arg Phe 
290 295 300 

lie Leu Asp Asn Pro Gly Phe Val Thr Phe Gly Trp Tyr Arg Leu Lys 
305 310 315 320 

Pro Gly Arg Gly Asn Ala Pro Ala Gin Pro Arg Pro Pro Thr Ala Phe 
325 330 335 

Gly Thr Ser Ser Asp Val Glu Phe Asn Cys Thr Ala Asp Asn Leu Ala 
340 345 350 

Val Glu Gly Ala Met Cys Asp Leu Pro Ala Tyr Lys Leu Met Cys Phe 
355 360 365 

Asp lie Glu Cys Lys Ala Gly Gly Glu Asp Glu Leu Ala Phe Pro Val 
370 375 380 

Ala Glu Arg Pro Glu Asp Leu Val lie Gin lie Ser Cys Leu Leu Tyr 
385 390 395 400 

Asp Leu Ser Thr Thr Ala Leu Glu His He Leu Leu Phe Ser Leu Gly 
405 410 415 

Ser Cys Asp Leu Pro Glu Ser His Leu Ser Asp Leu Ala Ser Arg Gly 
420 425 430 

Leu Pro Ala Pro Val Val Leu Glu Phe Asp Ser Glu Phe Glu Met Leu 
435 440 445 

Leu Ala Phe Met Thr Phe Val Lys Gin Tyr Gly Pro Glu Phe Val Thr 
450 455 460 

Gly Tyr Asn He He Asn Phe Asp Trp Pro Phe Val Leu Thr Lys Leu 
465 470 475 480 

Thr Glu He Tyr Lys Val Pro Leu Asp Gly Tyr Gly Arg Met Asn Gly 
485 490 495 

Arg Gly Val Phe Arg Val Trp Asp He Gly Gin Ser His Phe Gin Lys 
500 505 510 
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Arg Ser Lys He Lys Val Asn Gly Met Val Asn He Asp Met Tyr Gly 
515 520 525 

He He Thr Asp Lys Val Lys Leu Ser Ser Tyr Lys Leu Asn Ala Val 
530 535 540 

Ala Glu Ala Val Leu Lys Asp Lys Lys Lys Asp Leu Ser Tyr Arg Asp 
545 550 555 560 

He Pro Ala Tyr Tyr Ala Ser Gly Pro Ala Gin Arg Gly Val He Gly 
565 570 575 

Glu Tyr Cys Val Gin Asp Ser Leu Leu Val Gly Gin Leu Phe Phe Lys 
580 585 590 

Phe Leu Pro His Leu Glu Leu Ser Ala Val Ala Arg Leu Ala Gly He 
595 600 605 

Asn He Thr Arg Thr He Tyr Asp Gly Gin Gin He Arg Val Phe Thr 
610 615 620 

Cys Leu Leu Arg Leu Ala Gly Gin Lys Gly Phe He Leu Pro Asp Thr 
625 630 635 640 

Gin Gly Arg Phe Arg Gly Leu Asp Lys Glu Ala Pro Lys Arg Pro Ala 
645 650 655 

Val Pro Arg Gly Glu Gly Glu Arg Pro Gly Asp Gly Asn Gly Asp Glu 
660 665 670 

Asp Lys Asp Asp Asp Glu Asp Glu Asp Gly Asp Glu Arg Glu Glu Val 
675 680 685 

Ala Arg Glu Thr Gly Gly Arg His Val Gly Tyr Gin Gly Ala Arg Val 
690 695 700 

Leu Asp Pro Thr Ser Gly Phe His Val Asp Pro Val Val Val Phe Asp 
705 710 715 720 

Phe Ala Ser Leu Tyr Pro Ser He He Gin Ala His Asn Leu Cys Phe 
725 730 735 

Ser Thr Leu Ser Leu Arg Pro Glu Ala Val Ala His Leu Glu Ala Asp 
740 745 750 

Arg Asp Tyr Leu Glu He Glu Val Gly Gly Arg Arg Leu Phe Phe Val 
755 760 765 

Lys Ala His Val Arg Glu Ser Leu Leu Ser He Leu Leu Arg Asp Trp 
770 775 780 

Leu Ala Met Arg Lys Gin He Arg Ser Arg He Pro Gin Ser Thr Pro 
785 790 795 800 

Glu Glu Ala Val Leu Leu Asp Lys Gin Gin Ala Ala He Lys Val Val 
805 810 815 

Cys Asn Ser Val Tyr Gly Phe Thr Gly Ala Gin His Gly Leu Leu Pro 
820 825 830 

Cys Leu His Val Ala Ala Thr Val Thr Thr He Gly Arg Glu Met Leu 
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835 840 845 

Leu Ala Thr Arg Ala Tyr Val His Ala Arg Trp Ala Glu Phe Asp Gin 
850 855 860 

Leu Leu Ala Asp Phe Pro Glu Ala Ala Gly Met Arg Ala Pro Gly Pro 
865 870 875 880 

Tyr Ser Met Arg lie lie Tyr Gly Asp Thr Asp Ser lie Phe Val Leu 
885 890 895 

Cys Arg Gly Leu Thr Ala Ala Gly Leu Val Ala Met Gly Asp Lys Met 
900 905 910 

Ala Ser His lie Ser Arg Ala Leu Phe Leu Pro Pro lie Lys Leu Glu 
915 920 925 

Cys Glu Lys Thr Phe Thr Lys Leu Leu Leu lie Ala Lys Lys Lys Tyr 
930 935 940 

lie Gly Val lie Cys Gly Gly Lys Met Leu lie Lys Gly Val Asp Leu 
945 950 955 960 

Val Arg Lys Asn Asn Cys Ala Phe lie Asn Arg Thr Ser Arg Ala Leu 
965 970 975 

Val Asp Leu Leu Phe Tyr Asp Asp Thr Val Ser Gly Ala Ala Ala Ala 
980 985 990 

Leu Ala Glu Arg Pro Ala Glu Glu Trp Leu Ala Arg Pro Leu Pro Glu 
995 1000 1005 

Gly Leu Gin Ala Phe Gly Ala Val Leu Val Asp Ala His Arg Arg 
1010 1015 1020 

He Thr Asp Pro Glu Arg Asp He Gin Asp Phe Val Leu Thr Ala 
1025 1030 1035 

Glu Leu Ser Arg His Pro Arg Ala Tyr Thr Asn Lys Arg Leu Ala 
1040 1045 1050 

His Leu Thr Val Tyr Tyr Lys Leu Met Ala Arg Arg Ala Gin Val 
1055 1060 1065 

Pro Ser He Lys Asp Arg He Pro Tyr Val He Val Ala Gin Thr 
1070 1075 1080 

Arg Glu Val Glu Glu Thr Val Ala Arg Leu Ala Ala Leu Arg Glu 
1085 1090 1095 

Leu Asp Ala Ala Ala Pro Gly Asp Glu Pro Ala Pro Pro Ala Ala 
1100 1105 1110 

Leu Pro Ser Pro Ala Lys Arg Pro Arg Glu Thr Pro Ser His Ala 
1115 1120 1125 

Asp Pro Pro Gly Gly Ala Ser Lys Pro Arg Lys Leu Leu Val Ser 
1130 1135 1140 

Glu Leu Ala Glu Asp Pro Gly Tyr Ala He Ala Arg Gly Val Pro 
1145 1150 1155 



6 



WO 02/06513 PCT/US01/16525 

Leu Asn Thr Asp Tyr Tyr Phe Ser His Leu Leu Gly Ala Ala Cys 
1160 1165 1170 

Val Thr Phe Lys Ala Leu Phe Gly Asn Asn Ala Lys He Thr Glu 
1175 1180 1185 

Ser Leu Leu Lys Arg Phe He Pro Glu Thr Trp His Pro Pro Asp 
1190 1195 1200 

Asp Val Ala Ala Arg Leu Arg Ala Ala Gly Phe Gly Pro Ala Gly 
1205 1210 1215 

Ala Gly Ala Thr Ala Glu Glu Thr Arg Arg Met Leu His Arg Ala 
1220 1225 1230 

Phe Asp Thr Leu Ala 
1235 

<210> 3 

<211> 3723 

<212> DNA 

<213> herpes simplex 

<400> 3 



atgttttgtg 


ccgcgggcgg 


cccggcttcc 


cccgggggga 


agteggegge 


tegggeggeg 


60 


tctgggtttt 


ttgcccccca 


caacccccgg 


ggagccaccc 


agacggcacc 


gccgccttgc 


120 


cgccggcaga 


acttctacaa 


cccccacctc 


gctcagaccg 


gaacgcagcc 


aaaggccccc 


180 


y y y ^— ~ ^* 23 ^* ^— ' 
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cgttcgctgg 


acgaggacgc 


ccccgcggag 


cagcgcaccg 


gggtccacga 


cggccgcctc 


300 


cggcgcgccc 


ctaaggtgta 


ctgcgggggg 


gaegagegeg 


acgtcctccg 


cgtgggcccg 


360 


gagggcttct 


ggccgcgtcg 


Gttgcgcctg 


tggggcggtg 


cggaccatgc 


ccccgagggg 


420 


ttcgacccca 


ccgtcaccgt 


cttccacgtg 


tacgacatcc 


tggagcacgt 


ggaacacgcg 


480 


tacagcatgc 


gcgccgccca 


gctccacgag 


cgatttatgg 


acgccatcac 


gcccgccggg 


540 


accgtcatca 


cgcttctggg 


tctgaccccc 


gaaggecate 


gcgtcgccgt 


tcacgtctac 


600 


ggcacgcggc 


agtactttta 


catgaacaag 


gcggaggtgg 


atcggcacct 


gcagtgccgt 


660 


gccccgcgcg 


atctctgcga 


gcgcctggcg 


gcggccctgc 


gcgagtcgcc 


gggggegteg 


720 


ttccgcggca 


tctccgcgga 


ccacttcgag 


gcggaggtgg 


tggagegege 


cgacgtgtac 


780 


tattacgaaa 


cgcgcccgac 


cctgtactac 


cgcgtcttcg 


tgegaagegg 


gcgcgcgctg 


840 


gcctacctgt 


gcgacaactt 


ttgccccgcg 


atcaggaagt 


acgagggggg 


cgtcgacgcc 


900 


accacccggt 


ttatcctgga 


caacccgggg 


tttgtcacct 


tcggctggta 


ccgcctcaag 


960 


cccggccgcg 


ggaacgcgcc 


ggcccaaccg 


cgccccccga 


eggegttegg 


aacctcgagc 


1020 


gacgtcgagt 


ttaactgcac 


ggeggacaac 


ctggccgtcg 


agggggecat 


gtgtgacctg 


1080 


ccggcctaca 


agctcatgtg 


cttcgatatc 


gaatgeaagg 


ccggggggga 


ggacgagctg 


1140 
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gcctttccgg 


tcgcggaacg 


cccggaagac 


ctcgtcatcc 


agatctcctg 


tctgctctac 


1200 


gacctgtcca 


ccaccgccct 


cgagcacatc 


ctcctgtttt 


cgctcggatc 


ctgcgacctc 


1260 


cccgagtccc 


acctcagcga 


tctcgcctcc 


aggggcctgc 


cggcccccgt 


cgtcctggag 


1320 


tttgacagcg 


aattcgagat 


gctgctggcc 


ttcatgacct 


tcgtcaagca 


gtacggcccc 


13 80 


gagttcgtga 


ccgggtacaa 


catcatcaac 


ttcgactggc 


ccttcgtcct 


gaccaagctg 


1440 


acggagatct 


acaaggtccc 


gctcgacggg 


tacgggcgca 


tgaacggccg 


gggtgtgttc 


1500 


cgcgtgtggg 


acatcggcca 


gagccacttt 


cagaagcgca 


gcaagatcaa 


ggtgaacggg 


1560 


atggtgaaca 


tcgacatgta 


cggcatcatc 


accgacaagg 


tcaaactctc 


cagctacaag 


1620 


ctgaacgccg 


tcgccgaggc 


cgtcttgaag 


gacaagaaga 


aggatctgag 


ctaccgcgac 


1680 


atccccgcct 


actacgcctc 


cgggcccgcg 


cagcgcgggg 


tgatcggcga 


gtattgtgtg 


1740 


caggactcgc 


tgctggtcgg 


gcagctgttc 


ttcaagtttc 


tgccgcacct 


ggagctttcc 


1800 


gccgtcgcgc 


gcctggcggg 


catcaacatc 


acccgcacca 


tctacgacgg 


ccagcagatc 


1860 


cgcgtcttca 


cgtgcctcct 


gcgccttgcg 


ggccagaagg 


gcttcatcct 


gccggacacc 


1920 


caggggcggt 


ttcggggcct 


cgacaaggag 


gcgcccaagc 


gcccggccgt 


gcctcggggg 


1980 


gaaggggagc 


ggccggggga 


cgggaacggg 


gacgaggata 


aggacgacga 


cgaggacggg 


2040 


gacgaggacg 


gggacgagcg 


cgaggaggtc 


gcgcgcgaga 


ccgggggccg 


gcacgttggg 


2100 


taccaggggg 


cccgggtcct 


cgaccccacc 


tccgggtttc 


acgtcgaccc 


cgtggtggtg 


2160 


tttgactttg 


ccagcctgta 


ccccagcatc 


atccaggccc 


acaacctgtg 


cttcagtacg 


2220 


ctctccctgc 


ggcccgaggc 


cgtcgcgcac 


ctggaggcgg 


accgggacta 


cctggagatc 


2280 


gaggtggggg 


gccgacggct 


gttcttcgtg 


aaggcccacg 


tacgcgagag 


cctgctgagc 


2340 


atcctgctgc 


gcgactggct 


ggccatgcga 


aagcagatcc 


gctcgcggat 


cccccagagc 


2400 


ccccccgagg 


aggccgtcct 


cctcgacaag 


caacaggccg 


ccatcaaggt 


ggtgtgcaac 


2460 


tcggtgtacg 


ggttcaccgg 


ggcgcagcac 


ggtcttctgc 


cctgcctgca 


cgtggccgcc 


2520 


accgtgacga 


ccatcggccg 


cgagatgctc 


ctcgcgacgc 


gcgcgtacgt 


gcacgcgcgc 


2580 


tgggcggagt 


tcgatcagct 


gctggccgac 


tttccggagg 


cggccggcat 


gcgcgccccc 


2640 


ggtccgtact 


ccatgcgcat 


catctacggg 


gacacggact 


ccattttcgt 


tttgtgccgc 


2700 


ggcctcacgg 


ccgcgggcct 


ggtggccatg 


ggcgacaaga 


tggcgagcca 


catctcgcgc 


2760 


acactqttcc 


tccccccgat 


caagctcgag 


tgcgaaaaaa 


cgttcaccaa 


gctgctgctc 


2820 


atcgccaaga 


aaaagtacat 


cggcgtcatc 


tgcgggggca 


agatgctcat 


caagggcgtg 


2880 


gatctggtgc 


gcaaaaacaa 


ctgcgcgttt 


atcaaccgca 


cctccagggc 


cctggtcgac 


2940 


ctgctgtttt 


acgacgatac 


cgtatccgga 


gcggccgccg 


cgttagccga 


gcgccccgca 


3000 
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gaggagtggc tggcgcgacc cctgcccgag ggactgcagg cgttcggggc cgtcctcgta 3060 

gacgcccatc ggcgcatcac cgacccggag agggacatcc aggactttgt cctcaccgcc 3120 

gaactgagca gacacccgcg cgcgtacacc aacaagcgcc tggcccacct gacggtgtat 3180 

tacaagctca tggcccgccg cgcgcaggtc ccgtccatca aggaccggat cccgtacgtg 3240 

atcgtggccc agacccgcga ggtagaggag acggtcgcgc ggctggccgc cctccgcgag 33 00 

ctagacgccg ccgccccagg ggacgagccc gcccccccag cggccctgcc ctccccggcc 33 60 

aagcgccccc gggagacgcc gtcgcatgcc gaccccccgg gaggcgcgtc caagccccgc 342 0 

aagctgctgg tgtccgagct ggcggaggat cccgggtacg ccatcgcccg gggcgttccg 3 480 

ctcaacacgg actattactt ctcgcacctg ctgggggcgg cctgcgtgac gttcaaggcc 3540 

ctgtttggaa ataacgccaa gatcaccgag agtctgttaa agaggtttat tcccgagacg 3 600 

tggcaccccc cggacgacgt ggccgcgcgg ctcagggccg cggggttcgg gccggcgggg 3 660 

gccggcgcta cggcggagga aactcgtcga atgttgcata gagcctttga tactctagca 3720 

tga 3723 

<210> 4 

<211> 1240 

<212> PRT 

<213> herpes simplex 

<400> 4 

Met Phe Cys Ala Ala Gly Gly Pro Ala Ser Pro Gly Gly Lys Ser Ala 
1 5 10 15 

Ala Arg Ala Ala Ser Gly Phe Phe Ala Pro His Asn Pro Arg Gly Ala 
20 25 30 

Thr Gin Thr Ala Pro Pro Pro Cys Arg Arg Gin Asn Phe Tyr Asn Pro 
35 40 45 

His Leu Ala Gin Thr Gly Thr Gin Pro Lys Ala Pro Gly Pro Ala Gin 
50 55 60 

Arg His Thr Tyr Tyr Ser Glu Cys Asp Glu Phe Arg Phe lie Ala Pro 
65 70 75 80 

Arg Ser Leu Asp Glu Asp Ala Pro Ala Glu Gin Arg Thr Gly Val His 
85 90 95 

Asp Gly Arg Leu Arg Arg Ala Pro Lys Val Tyr Cys Gly Gly Asp Glu 
100 105 110 

Arg Asp Val Leu Arg Val Gly Pro Glu Gly Phe Trp Pro Arg Arg Leu 
115 120 125 

Arg Leu Trp Gly Gly Ala Asp His Ala Pro Glu Gly Phe Asp Pro Thr 
130 135 140 

Val Thr Val Phe His Val Tyr Asp lie Leu Glu His Val Glu His Ala 
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145 150 155 160 

Tyr Ser Met Arg Ala Ala Gin Leu His Glu Arg Phe Met Asp Ala He 
165 170 175 

Thr Pro Ala Gly Thr Val He Thr Leu Leu Gly Leu Thr Pro Glu Gly 
180 185 190 

His Arg Val Ala Val His Val Tyr Gly Thr Arg Gin Tyr Phe Tyr Met 
195 200 205 

Asn Lys Ala Glu Val Asp Arg His Leu Gin Cys Arg Ala Pro Arg Asp 
210 215 220 

Leu Cys Glu Arg Leu Ala Ala Ala Leu Arg Glu Ser Pro Gly Ala Ser 
225 230 235 240 

Phe Arg Gly He Ser Ala Asp His Phe Glu Ala Glu Val Val Glu Arg 
245 250 255 

Ala Asp Val Tyr Tyr Tyr Glu Thr Arg Pro Thr Leu Tyr Tyr Arg Val 
260 265 270 

Phe Val Arg Ser Gly Arg Ala Leu Ala Tyr Leu Cys Asp Asn Phe Cys 
275 280 285 

Pro Ala lie Arg Lys Tyr Glu Gly Gly Val Asp Ala Thr Thr Arg Phe 
290 295 300 

He Leu Asp Asn Pro Gly Phe Val Thr Phe Gly Trp Tyr Arg Leu Lys 
305 310 315 320 

Pro Gly Arg Gly Asn Ala Pro Ala Gin Pro Arg Pro Pro Thr Ala Phe 
325 330 335 

Gly Thr Ser Ser Asp Val Glu Phe Asn Cys Thr Ala Asp Asn Leu Ala 
340 345 350 

Val Glu Gly Ala Met Cys Asp Leu Pro Ala Tyr Lys Leu Met Cys Phe 
355 360 365 

Asp lie Glu Cys Lys Ala Gly Gly Glu Asp Glu Leu Ala Phe Pro Val 
370 375 380 

Ala Glu Arg Pro Glu Asp Leu Val He Gin He Ser Cys Leu Leu Tyr 
385 390 395 400 

Asp Leu Ser Thr Thr Ala Leu Glu His He Leu Leu Phe Ser Leu Gly 
405 410 415 

Ser Cys Asp Leu Pro Glu Ser His Leu Ser Asp Leu Ala Ser Arg Gly 
420 425 430 

Leu Pro Ala Pro Val Val Leu Glu Phe Asp Ser Glu Phe Glu Met Leu 
435 440 445 

Leu Ala Phe Met Thr Phe Val Lys Gin Tyr Gly Pro Glu Phe Val Thr 
450 455 460 

Gly Tyr Asn He He Asn Phe Asp Trp Pro Phe Val Leu Thr Lys Leu 
465 470 475 480 
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Thr Glu lie Tyr Lys Val Pro Leu Asp Gly Tyr Gly Arg Met Asn Gly 
485 490 495 

Arg Gly Val Phe Arg Val Trp Asp lie Gly Gin Ser His Phe Gin Lys 
500 505 510 

Arg Ser Lys lie Lys Val Asn Gly Met Val Asn lie Asp Met Tyr Gly 
515 520 525 

lie lie Thr Asp Lys Val Lys Leu Ser Ser Tyr Lys Leu Asn Ala Val 
530 535 540 

Ala Glu Ala Val Leu Lys Asp Lys Lys Lys Asp Leu. Ser Tyr Arg Asp 
545 550 555 560 

lie Pro Ala Tyr Tyr Ala Ser Gly Pro Ala Gin Arg Gly Val lie Gly 
565 570 575 

Glu Tyr Cys Val Gin Asp Ser Leu Leu Val Gly Gin Leu Phe Phe Lys 
580 585 590 

Phe Leu Pro His Leu Glu Leu Ser Ala Val Ala Arg Leu Ala Gly lie 
595 600 605 

Asn lie Thr Arg Thr lie Tyr Asp Gly Gin Gin lie Arg Val Phe Thr 
610 615 620 

Cys Leu Leu Arg Leu Ala Gly Gin Lys Gly Phe lie Leu Pro Asp Thr 
625 630 635 640 

Gin Gly Arg Phe Arg Gly Leu Asp Lys Glu Ala Pro Lys Arg Pro Ala 
645 650 655 

Val Pro Arg Gly Glu Gly Glu Arg Pro Gly Asp Gly Asn Gly Asp Glu 
660 665 670 

Asp Lys Asp Asp Asp Glu Asp Gly Asp Glu Asp Gly Asp Glu Arg Glu 
675 680 685 

Glu Val Ala Arg Glu Thr Gly Gly Arg His Val Gly Tyr Gin Gly Ala 
690 695 700 

Arg Val Leu Asp Pro Thr Ser Gly Phe His Val Asp Pro Val Val Val 
705 710 715 720 

Phe Asp Phe Ala Ser Leu Tyr Pro Ser lie lie Gin Ala His Asn Leu 
725 730 735 

Cys Phe Ser Thr Leu Ser Leu Arg Pro Glu Ala Val Ala His Leu Glu 
740 745 750 

Ala Asp Arg Asp Tyr Leu Glu lie Glu Val Gly Gly Arg Arg Leu Phe 
755 760 765 

Phe Val Lys Ala His Val Arg Glu Ser Leu Leu Ser He Leu Leu Arg 
770 775 780 

Asp Trp Leu Ala Met Arg Lys Gin lie Arg Ser' Arg He Pro Gin Ser 
785 790 795 . 800 

Pro Pro Glu Glu Ala Val Leu Leu Asp Lys Gin Gin Ala Ala He Lys 
805 810 815 
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Val Val Cys Asn Ser Val Tyr Gly Phe Thr Gly Ala Gin His Gly Leu 
820 825 830 

Leu Pro Cys Leu His Val Ala Ala Thr Val Thr Thr lie Gly Arg Glu 
835 840 845 

Met Leu Leu Ala Thr Arg Ala Tyr Val His Ala Arg Trp Ala Glu Phe 
850 855 860 

Asp Gin Leu Leu Ala Asp Phe Pro Glu Ala Ala Gly Met Arg Ala Pro 
865 870 875 880 

Gly Pro Tyr Ser Met Arg lie lie Tyr Gly Asp Thr Asp Ser lie Phe 
885 890 895 

Val Leu Cys Arg Gly Leu Thr Ala Ala Gly Leu Val Ala Met Gly Asp 
900 905 910 

Lys Met Ala Ser His lie Ser Arg Ala Leu Phe Leu Pro Pro lie Lys 
915 920 925 

Leu Glu Cys Glu Lys Thr Phe Thr Lys Leu Leu Leu lie Ala Lys Lys 
930 935 940 

Lys Tyr He Gly Val He Cys Gly Gly Lys Met Leu lie Lys Gly Val 
945 950 955 960 

Asp Leu Val Arg Lys Asn Asn Cys Ala Phe He Asn Arg Thr Ser Arg 
965 970 975 

Ala Leu Val Asp Leu Leu Phe Tyr Asp Asp Thr Val Ser Gly Ala Ala 
980 985 990 

Ala Ala Leu Ala Glu Arg Pro Ala Glu Glu Trp Leu Ala Arg Pro Leu 
995 1000 1005 

Pro Glu Gly Leu Gin Ala Phe Gly Ala Val Leu Val Asp Ala His 
1010 1015 1020 

Arg Arg lie Thr Asp Pro Glu Arg Asp lie Gin Asp Phe Val Leu 
1025 1030 1035 

Thr Ala Glu Leu Ser Arg His Pro Arg Ala Tyr Thr Asn Lys Arg 
1040 1045 1050 

Leu Ala His Leu Thr Val Tyr Tyr Lys Leu Met Ala Arg Arg Ala 
1055 1060 1065 

Gin Val Pro Ser He Lys Asp Arg He Pro Tyr Val He Val Ala 
1070 1075 1080 

Gin Thr Arg Glu Val Glu Glu Thr Val Ala Arg Leu Ala Ala Leu 
1085 1090 1095 

Arg Glu Leu Asp Ala Ala Ala Pro Gly Asp Glu Pro Ala Pro Pro 
1100 1105 1110 

Ala Ala Leu Pro Ser Pro Ala Lys Arg Pro Arg Glu Thr Pro Ser 
1115 1120 1125 

His Ala Asp Pro Pro Gly Gly Ala Ser Lys Pro Arg Lys Leu Leu 
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1130 1135 1140 

Val Ser Glu Leu Ala Glu Asp Pro Gly Tyr Ala lie Ala Arg Gly 
1145 1150 1155 

Val Pro Leu Asn Thr Asp Tyr Tyr Phe Ser His Leu Leu Gly Ala 
1160 1165 1170 

Ala Cys Val Thr Phe Lys Ala Leu Phe Gly Asn Asn Ala Lys lie 
1175 1180 1185 

Thr Glu Ser Leu Leu Lys Arg Phe lie Pro Glu Thr Trp His Pro 
1190 1195 1200 

Pro Asp Asp Val Ala Ala Arg Leu Arg Ala Ala Gly Phe Gly Pro 
1205 1210 1215 

Ala Gly Ala Gly Ala Thr Ala Glu Glu Thr Arg Arg Met Leu His 
1220 1225 1230 

Arg Ala Phe Asp Thr Leu Ala 
1235 1240 

<210> 5 
<211> 3708 
<212> DNA 

<213> herpes simplex 
<400> 5 

atgttttccg gtggcggcgg cccgctgtcc cccggaggaa agtcggcggc cagggcggcg 60 

tccgggtttt ttgcgcccgc cggccctcgc ggagccggcc ggggaccccc gccttgtttg 12 0 

aggcaaaact tttacaaccc ctacctcgcc ccagtcggga cgcaacagaa gccgaccggg 180 

ccaacccagc gccatacgta ctatagcgaa tgcgatgaat ttcgattcat cgccccgcgg 240 

gtgctggacg aggatgcccc cccggagaag cgcgccgggg tgcacgacgg tcacctcaag 3 00 

cgcgccccca aggtgtactg cgggggggac gagcgcgacg tcctccgcgt cgggtcgggc 3 60 

ggcttctggc cgcggcgctc gcgcctgtgg ggcggcgtgg accacgcccc ggcggggttc 420 

aaccccaccg tcaccgtctt tcacgtgtac gacatcctgg agaacgtgga gcacgcgtac 480 

ggcatgcgcg cggcccagtt ccacgcgcgg tttatggacg ccatcacacc gacggggacc 540 

gtcatcacgc tcctgggcct gactccggaa ggccaccggg tggccgttca cgtttacggc 600 

acgcggcagt acttttacat gaacaaggag gaggttgaca ggcacctaca atgccgcgcc 660 

ccacgagatc tctgcgagcg catggccgcg gccctgcgcg agtccccggg cgcgtcgttc 720 

cgcggcatct ccgcggacca cttcgaggcg gaggtggfcgg agcgcaccga cgtgtactac 780 

tacgagacgc gccccgctct gttttaccgc gtctacgtcc gaagcgggcg cgtgctgtcg 840 

tacctgtgcg acaacttctg cccggccatc aagaagtacg agggtggggt cgacgccacc 9 00 

acccggttca tcctggacaa ccccgggttc gtcaccttcg gctggtaccg tctcaaaccg 960 

ggccggaaca acacgctagc ccagccgcgg gccccgatgg ccttcgggac atccagcgac 1020 
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gtcgagttta 


actgtacggc 


ggacaacctg 


gccatcgagg 


ggggcatgag 


cgacctaccg 


1080 


gcatacaagc 


tcatgtgctt 


cgatatcgaa 


tgcaaggcgg 


ggggggagga 


cgagctggcc 


1140 


tttccggtgg 


ccgggcaccc 


ggaggacctg 


gttattcaga 


tatcctgtct 


gctctacgac 


1200 


ctgtccacca 


ccgccctgga 


gcacgtcctc 


ctgttttcgc 


tcggttcctg 


cgacctcccc 


1260 


gaatcccacc 


tgaacgagct 


ggcggccagg 


ggcctgccca 


cgcccgtggt 


tctggaattc 


1320 


gacagcgaat 


tcgagatgct 


gttggccttc 


atgacccttg 


tgaaacagta 


cggccccgag 


1380 


ttcgtgaccg 


ggtacaacat 


catcaacttc 


gactggccct 


tcttgctggc 


caagttgacg 


1440 


gacatttaca 


aggtccccct 


ggacgggtac 


ggccgcatga 


acggccgggg 


cgtgtttcgc 


1500 


gtgtgggaca 


taggccagag 


ccacttccag 


aagcgcagca 


agataaaggt 


gaacggcatg 


1560 


gtgaacatcg 


acatgtacgg 


gatcataacc 


gacaagatca 


agctctcgag 


ctacaagctc 


1620 


aacgccgtgg 


ccgaagccgt 


cctgaaggac 


aagaagaagg 


acctgagcta 


tcgcgacatc 


1680 


cccgcctact 


acgccgccgg 


gcccgcgcaa 


cgcggggtga 


tcggcgagta 


ctgcatacag 


1740 


gattccctgc 


tggtgggcca 


gctgtttttt 


aagtttttgc 


cccatctgga 


gctctcggcc 


1800 


gtcgcgcgct 


tggcgggtat 


taacatcacc 


cgcaccatct 


acgacggcca 


gcagatccgc 


1860 


gtctttacgt 


gcctgctgcg 


cctggccgac 


cagaagggct 


ttattctgcc 


ggacacccag 


1920 


gggcgattta 


ggggcgccgg 


gggggaggcg 


cccaagcgtc 


cggccgcagc 


ccgggaggac 


1980 


gaggagcggc 


cagaggagga 


gggggaggac 


gaggacgaac 


gcgaggaggg 


cgggggcgag 


2040 


cgggagccgg 


agggcgcgcg 


ggagaccgcc 


ggccggcacg 


tggggtacca 


gggggccagg 


2100 


gtccttgacc 


ccacttccgg 


gtttcacgtg 


aaccccgtgg 


tggtgttcga 


ctttgccagc 


2160 


ctgtacccca 


gcatcatcca 


ggcccacaac 


ctgtgcttca 


gcacgctctc 


cctgagggcc 


2220 


gacgcagtgg 


cgcacctgga 


ggcgggcaag 


gactacctgg 


agatcgaggt 


gggggggcga 


2280 


cggctgttct 


tcgtcaaggc 


tcacgtgcga 


gagagcctcc 


tcagcatcct 


cctgcgggac 


2340 


tggctcgcca 


tgcgaaagca 


gatccgctcg 


cggattcccc 


agagcagccc 


cgaggaggcc 


2400 


gtgctcctgg 


acaagcagca 


ggccgccatc 


aaggtcgtgt 


gtaactcggt 


gtacgggttc 


2460 


acgggagcgc 


agcacggact 


cctgccgtgc 


ctgcacgttg 


ccgcgacggt 


gacgaccatc 


2520 


ggccgcgaga 


tgctgctcgc 


gacccgcgag 


tacgtccacg 


cgcgctgggc 


ggccttcgaa 


2580 


cagctcctgg 


ccgatttccc 


ggaggcggcc 


gacatgcgcg 


cccccgggcc 


ctattccatg 


2640 


cgcatcatct 


acggggacac 


ggactccata 


tttgtgctgt 


gccgcggcct 


cacggccgcc 


2700 


gggctgacgg 


ccatgggcga 


caagatggcg 


agccacatct 


cgcgcgcgct 


gtttctgccc 


2760 


cccatcaaac 


tcgagtgcga 


aaagacgttc 


accaagctgc 


tgctgatcgc 


caagaaaaag 


2820 


tacatcggcg 


tcatctacgg 


gggtaagatg 


ctcatcaagg 


gcgtggatct 


ggtgcgcaaa 


2880 



14 



WO 02/06513 PCT/US01/16525 



aacaactgcg 


cgtttatcaa 


ccgcacctcc 


agggccctgg 


tcgacctgct 


gttttacgac 


2940 


gataccgtat 


ccggagcggc 


cgccgcgtta 


gccgagcgcc 


ccgcagagga 


gtggctggcg 


3000 


cgacccctgc 


ccgagggact 


gcaggcgttc 


ggggccgtcc 


tcgtagacgc 


ccatcggcgc 


3060 


atcaccgacc 


cggagaggga 


catccaggac 


tttgtcctca 


ccgccgaact 


gagcagacac 


3120 


ccgcgcgcgt 


acaccaacaa 


gcgcctggcc 


cacctgacgg 


tgtattacaa 


gctcatggcc 


3180 


cgccgcgcgc 


aggtcccgtc 


catcaaggac 


cggatcccgt 


acgtgatcgt 


ggcccagacc 


3240 


cgcgaggtag 


aggagacggt 


cgcgcggctg 


gccgccctcc 


gcgagctaga 


cgccgccgcc 


3300 


ccaggggacg 


agcccgcccc 


ccccgcggcc 


ctgccctccc 


cggccaagcg 


cccccgggag 


3360 


acgccgtcgc 


atgccgaccc 


cccgggaggc 


gcgtccaagc 


cccgcaagct 


gctggtgtcc 


3420 


gagctggccg 


aggatcccgc 


atacgccatt 


gcccacggcg 


tcgccctgaa 


cacggactat 


3480 


tacttctccc 


acctgttggg 


ggcggcgtgc 


gtgacattca 


aggccctgtt 


tgggaataac 


3540 


gccaagatca 


ccgagagtct 


gttaaaaagg 


tttattcccg 


aagtgtggca 


ccccccggac 


3600 


gacgtggccg 


cgcggctccg 


ggccgcaggg 


ttcggggcgg 


tgggtgccgg 


cgctacggcg 


3660 


gaggaaactc 


gtcgaatgtt 


gcatagagcc 


tttgatactc 


tagcatga 




3708 



<210> 6 

<211> 1235 

<212> PRT 

<213> herpes simplex 

<400> 6 

Met Phe Ser Gly Gly Gly 
1 5 

Ala Arg Ala Ala Ser Gly 
20 

Gly Arg Gly Pro Pro Pro 
35 

Leu Ala Pro Val Gly Thr 
50 

His Thr Tyr Tyr Ser Glu 
65 70 

Val Leu Asp Glu Asp Ala 
85 

Gly His Leu Lys Arg Ala 
100 

Asp Val Leu Arg Val Gly 
115 

Leu Trp Gly Gly Val Asp 



Gly Pro Leu Ser Pro Gly Gly Lys Ser Ala 
10 15 

Phe Phe Ala Pro Ala Gly Pro Arg Gly Ala 
25 30 

Cys Leu Arg Gin Asn Phe Tyr Asn Pro Tyr 
40 45 

Gin Gin Lys Pro Thr Gly Pro Thr Gin Arg 
55 60 

Cys Asp Glu Phe Arg Phe lie Ala Pro Arg 
75 80 

Pro Pro Glu Lys Arg Ala Gly Val His Asp 
90 95 

Pro Lys Val Tyr Cys Gly Gly Asp Glu Arg 
105 110 

Ser Gly Gly Phe Trp Pro Arg Arg Ser Arg 
120 125 

His Ala Pro Ala Gly Phe Asn Pro Thr Val 
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130 

Thr Val Phe His Val 
145 

Gly Met Arg Ala Ala 
165 

Pro Thr Gly Thr Val 
180 

Arg Val Ala Val His 
195 

Lys Glu Glu Val Asp 
210 

Cys Glu Arg Met Ala 
225 

Arg Gly He Ser Ala 
245 

Asp Val Tyr Tyr Tyr 
260 

Val Arg Ser Gly Arg 
275 

Ala He Lys Lys Tyr 
290 

Leu Asp Asn Pro Gly 
305 

Gly Arg Asn Asn Thr 
325 

Thr Ser Ser Asp Val 
340 

Glu Gly Gly Met Ser 
355 

lie Glu Cys Lys Ala 
370 

Gly His Pro Glu Asp 
385 

Leu Ser Thr Thr Ala 
405 

Cys Asp Leu Pro Glu 
420 

Pro Thr Pro Val Val 
435 

Ala Phe Met Thr Leu 
450 



135 

Tyr Asp He Leu Glu Asn 
150 155 

Gin Phe His Ala Arg Phe 
170 

He Thr Leu Leu Gly Leu 
185 

Val Tyr Gly Thr Arg Gin 
200 

Arg His Leu Gin Cys Arg 
215 

Ala Ala Leu Arg Glu Ser 
230 235 

Asp His Phe Glu Ala Glu 
250 

Glu Thr Arg Pro Ala Leu 
265 

Val Leu Ser Tyr Leu Cys 
280 

Glu Gly Gly Val Asp Ala 
295 

Phe Val Thr Phe Gly Trp 
310 315 

Leu Ala Gin Pro Arg Ala 
330 

Glu Phe Asn Cys Thr Ala 
345 

Asp Leu Pro Ala Tyr Lys 
360 

Gly Gly Glu Asp Glu Leu 
375 

Leu Val He Gin He Ser 
390 395 

Leu Glu His Val Leu Leu 
410 

Ser His Leu Asn Glu Leu 
425 

Leu Glu Phe Asp Ser Glu 
440 

Val Lys Gin Tyr Gly Pro 
455 



140 

Val Glu His Ala Tyr 
160 

Met Asp Ala He Thr 
175 

Thr Pro Glu Gly His 
190 

Tyr Phe Tyr Met Asn 
205 

Ala Pro Arg Asp Leu 
220 

Pro Gly Ala Ser Phe 
240 

Val Val Glu Arg Thr 
255 

Phe Tyr Arg Val Tyr 
270 

Asp Asn Phe Cys Pro 
285 

Thr Thr Arg Phe He 
300 

Tyr Arg Leu Lys Pro 
320 

Pro Met Ala Phe Gly 
335 

Asp Asn Leu Ala He 
350 

Leu Met Cys Phe Asp 
365 

Ala Phe Pro Val Ala 
380 

Cys Leu Leu Tyr Asp 
400 

Phe Ser Leu Gly Ser 
415 

Ala' Ala Arg Gly Leu 
430 

Phe Glu Met Leu Leu 
445 

Glu Phe Val Thr Gly 
460 
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Tyr Asn lie He Asn Phe Asp Trp Pro Phe Leu Leu Ala Lys Leu Thr 
465 470 475 480 

Asp He Tyr Lys Val Pro Leu Asp Gly Tyr Gly Arg Met Asn Gly Arg 
485 490 495 

Gly Val Phe Arg Val Trp Asp He Gly Gin Ser His Phe Gin Lys Arg 
500 505 510 

Ser Lys He Lys Val Asn Gly Met Val Asn He Asp Met Tyr Gly He 
515 520 525 

He Thr Asp Lys He Lys Leu Ser Ser Tyr Lys Leu Asn Ala Val Ala 
530 535 540 

Glu Ala Val Leu Lys Asp Lys Lys Lys Asp Leu Ser Tyr Arg Asp He 
545 550 555 560 

Pro Ala Tyr Tyr Ala Ala Gly Pro Ala Gin Arg Gly Val He Gly Glu 
565 570 575 

Tyr Cys He Gin Asp Ser Leu Leu Val Gly Gin Leu Phe Phe Lys Phe 
580 585 590 

Leu Pro His Leu Glu Leu Ser Ala Val Ala Arg Leu Ala Gly He Asn 
595 600 605 

He Thr Arg Thr He Tyr Asp Gly Gin Gin He Arg Val Phe Thr Cys 
610 615 620 

Leu Leu Arg Leu Ala Asp Gin Lys Gly Phe He Leu Pro Asp Thr Gin 
625 630 635 640 

Gly Arg Phe Arg Gly Ala Gly Gly Glu Ala Pro Lys Arg Pro Ala Ala 
645 650 655 

Ala Arg Glu Asp Glu Glu Arg Pro Glu Glu Glu Gly Glu Asp Glu Asp 
660 665 670 

Glu Arg Glu Glu Gly Gly Gly Glu Arg Glu Pro Glu Gly Ala Arg Glu 
675 680 685 

Thr Ala Gly Arg His Val Gly Tyr Gin Gly Ala Arg Val Leu Asp Pro 
690 695 700 

Thr Ser Gly Phe His Val Asn Pro Val Val Val Phe Asp Phe Ala Ser 
705 710 715 720 

Leu Tyr Pro Ser He He Gin Ala His Asn Leu Cys Phe Ser Thr Leu 
725 730 735 

Ser Leu Arg Ala Asp Ala Val Ala His Leu Glu Ala Gly Lys Asp Tyr 
740 745 750 

Leu Glu He Glu Val Gly Gly Arg Arg Leu Phe Phe Val Lys Ala His 
755 760 765 

Val Arg Glu Ser Leu Leu Ser He Leu Leu Arg Asp Trp Leu Ala Met 
770 775 780 

Arg Lys Gin He Arg Ser Arg He Pro Gin Ser Ser Pro Glu Glu Ala 
785 790 795 800 
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Val Leu Leu Asp Lys Gin Gin Ala Ala lie Lys Val Val Cys Asn Ser 
805 810 815 

Val Tyr Gly Phe Thr Gly Ala Gin His Gly Leu Leu Pro Cys Leu His 
820 825 830 

Val Ala Ala Thr Val Thr Thr lie Gly Arg Glu Met Leu Leu Ala Thr 
835 840 845 

Arg Glu Tyr Val His Ala Arg Trp Ala Ala Phe Glu Gin Leu Leu Ala 
850 855 860 

Asp Phe Pro Glu Ala Ala Asp Met Arg Ala Pro Gly Pro Tyr Ser Met 
865 870 875 880 

Arg lie lie Tyr Gly Asp Thr Asp Ser lie Phe Val Leu Cys Arg Gly 
885 890 895 

Leu Thr Ala Ala Gly Leu Thr Ala Met Gly Asp Lys Met Ala Ser His 
900 905 910 

lie Ser Arg Ala Leu Phe Leu Pro Pro lie Lys Leu Glu Cys Glu Lys 
915 920 925 

Thr Phe Thr Lys Leu Leu Leu lie Ala Lys Lys Lys Tyr lie Gly Val 
930 935 940 

lie Tyr Gly Gly Lys Met Leu lie Lys Gly Val Asp Leu Val Arg Lys 
945 950 955 960 

Asn Asn Cys Ala Phe lie Asn Arg Thr Ser Arg Ala Leu Val Asp Leu 
965 970 975 

Leu Phe Tyr Asp Asp Thr Val Ser Gly Ala Ala Ala Ala Leu Ala Glu 
980 985 990 

Arg Pro Ala Glu Glu Trp Leu Ala Arg Pro Leu Pro Glu Gly Leu Gin 
995 1000 1005 

Ala Phe Gly Ala Val Leu Val Asp Ala His Arg Arg lie Thr Asp 
1010 1015 1020 

Pro Glu Arg Asp lie Gin Asp Phe Val Leu Thr Ala Glu Leu Ser 
1025 1030 1035 

Arg His Pro Arg Ala Tyr Thr Asn Lys Arg Leu Ala His ^ Leu Thr 
1040 1045 1050 

Val Tyr Tyr Lys Leu Met Ala Arg Arg Ala Gin Val Pro Ser lie 
1055 1060 1065 

Lys Asp Arg lie Pro Tyr Val He Val Ala Gin Thr Arg Glu Val 
1070 1075 1080 

Glu Glu Thr Val Ala Arg Leu Ala Ala Leu Arg Glu Leu Asp Ala 
1085 1090 1095 

Ala Ala Pro Gly Asp Glu Pro Ala Pro Pro Ala Ala Leu Pro Ser 
1100 1105 1110 

Pro Ala Lys Arg Pro Arg Glu Thr Pro Ser His Ala Asp Pro Pro 
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1115 1120 1125 

Gly Gly Ala Ser Lys Pro Arg Lys Leu Leu Val Ser Glu Leu Ala 
1130 1135 1140 

Glu Asp Pro Ala Tyr Ala lie Ala His Gly Val Ala Leu Asn Thr 
1145 1150 1155 

Asp Tyr Tyr Phe Ser His Leu Leu Gly Ala Ala Cys Val Thr Phe 
1160 1165 1170 

Lys Ala Leu Phe Gly Asn Asn Ala Lys lie Thr Glu Ser Leu Leu 
1175 1180 1185 

Lys Arg Phe lie Pro Glu Val Trp His Pro Pro Asp Asp Val Ala 
1190 1195 1200 

Ala Arg Leu Arg Ala Ala Gly Phe Gly Ala Val Gly Ala Gly Ala 
1205 1210 1215 

Thr Ala Glu Glu Thr Arg Arg Met Leu His Arg Ala Phe Asp Thr 
1220 1225 1230 

Leu Ala 
1235 

<210> 7 
<211> 3708 
<212> DNA 

<213> herpes simplex 
<400> 7 

atgttttccg gtggcggcgg cccgctgtcc cccggaggaa agtcggcggc cagggcggcg 60 

tccgggtttt ttgcgcccgc cggccctcgc ggagccggcc ggggaccccc gccttgcttg 12 0 

aggcaaaact tttacaaccc ctacctcgcc ccagtcggga cgcaacagaa gccgaccggg 180 

ccaacccagc gccatacgta ctatagcgaa tgcgatgaat ttcgattcat cgccccgcgg 2 40 

gtgctggacg aggatgcccc cccggagaag cgcgccgggg tgcacgacgg tcacctcaag 300 

cgcgccccca aggtgtactg cgggggggac gagcgcgacg tcctccgcgt cgggtcgggc 3 60 

ggcttctggc cgcggcgctc gcgcctgtgg ggcggcgtgg accacgcccc ggcggggttc 42 0 

aaccccaccg tcaccgtctt tcacgtgtac gacatcctgg agaacgtgga gcacgcgtac 480 

ggcatgcgcg cggcccagtt ccacgcgcgg tttatggacg ccatcacacc gacggggacc 540 

gtcatcacgc tcctgggcct gactccggaa ggccaccggg tggccgttca cgtttacggc 600 

acgcggcagt acttttacat gaacaaggag gaggtcgaca ggcacctaca atgccgcgcc 660 

ccacgagatc tctgcgagcg catggccgcg gccctgcgcg agtccccggg cgcgtcgttc 720 

cgcggcattt ccgcggacca cttcgaggcg gaggtggtgg agcgcaccga cgtgtactac 7 80 

tacgagacgc gccccgctct gttttaccgc gtctacgtcc gaagcgggcg cgtgctgtcg 840 

tacctgtgcg acaacttctg cccggccatc aagaagtacg agggtggggt cgacgccacc 900 
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acccggttca 


tcctggacaa 


ccccgggttc 


gtcaccttcg 


gctggtaccg 


tctcaaaccg 


960 


ggccggaaca 


acacgctagc 


ccagccgcgg 


gccccgatgg 


ccttcgggac 


atccagcgac 


1020 


gtcgagttta 


actgtacggc 


ggacaacctg 


gccatcgagg 


ggggcatgag 


cgacctaccg 


1080 


gcatacaagc 


tcatgtgctt 


cgatatcgaa 


tgcaaggcgg 


ggggggagga 


cgagctggcc 


1140 


tttccggtgg 


ccgggcaccc 


ggaggacctg 


gtcatccaga 


tatcctgtct 


gctctacgac 


1200 


ctgtccacca 


ccgccctgga 


gcacgtcctc 


ctgttttcgc 


tcggttcctg 


cgacctcccc 


1260 


gaatcccacc 


tgaacgagct 


ggcggccagg 


ggcctgccca 


cgcccgtggt 


tctggaattc 


1320 


gacagcgaat 


tcgagatgct 


gttggccttc 


atgacccttg 


tgaaacagta 


cggccccgag 


1380 


ttcgtgaccg 


ggtacaacat 


catcaacttc 


gactggccct 


tcttgctggc 


caagctgacg 


1440 


gacatttaca 


aggtccccct 


ggacgggtac 


ggccgcatga 


acggccgggg 


cgtgtttcgc 


1500 


gtgtgggaca 


taggccagag 


ccacttccag 


aagcgcagca 


agataaaggt 


gaacggcatg 


1560 


gtgaacatcg 


acatgtacgg 


gattataacc 


gacaagatca 


agctctcgag 


ctacaagctc 


1620 


aacgccgtgg 


ccgaagccgt 


cctgaaggac 


aagaagaagg 


acctgagcta 


tcgcgacatc 


1680 


cccgcctact 


acgccgccgg 


gcccgcgcaa 


cgcggggtga 


tcg-gcgagta 


ctgcatacag 


1740 


gattccctgc 


tggtgggcca 


gctgtttttt 


aagtttttgc 


cccatctgga 


gctctcggcc 


1800 


gtcgcgcgct 


tggcgggtat 


taacatcacc 


cgcaccatct 


acgacggcca 


gcagatccgc 


1860 


gtctttacgt 


gcctgctgcg 


cctggccgac 


cagaagggct 


ttattctgcc 


ggacacccag 


1920 


gggcgattta 


ggggcggcgg 


gggggaggcg 


cccaagcgtc 


cggccgcagc 


ccgggaggac 


1980 


gaggagcggc 


cagaggagga 


gggggaggac 


gaggacgaac 


gcgaggaggg 


cgggggcgag 


2040 


cgggagccgg 


agggcgcgcg 


ggagaccgcc 


ggccggcacg 


tggggtacca 


gggggccagg 


2100 


gtccttgacc 


ccacttccgg 


gtttcatgtg 


aaccccgtgg 


tggtgttcga 


ctttgccagc 


2160 


ctgtacccca 


gcatcatcca 


ggcccacaac 


ctgtgcttca 


gcacgctctc 


cctgagggcc 


2220 


gacgcagtgg 


cgcacctgga 


ggcgggcaag 


gactacctgg 


agatcgaggt 


gggggggcga 


2280 


cggctgttct 


tcgtcaaggc 


tcacgtgcga 


gagagcctcc 


tcagcatcct 


cctgcgggac 


2340 


tggctcgcca 


tgcgaaagca 


gatccgctcg 


cggattcccc 


agagcagccc 


cgaggaggcc 


2400 


gtgctcctgg 


acaagcagca 


ggccgccatc 


aaggtcgtgt 


gtaactcggt 


ttacgggttc 


2460 


acgggagcgc 


agcacggact 


cctgccgtgc 


ctgcacgttg 


ccgcgacggt 


gacgaccatc 


2520 


ggccgcgaga 


tgctgctcgc 


gacccgcgag 


tacgtccacg 


cgcgctgggc 


ggccttcgaa 


2580 


cagctcctgg 


ccgatttccc 


ggaggcggcc 


gacatgcgcg 


cccccgggcc 


ctattccatg 


2640 


cgcatcatct 


ac ggggacac 


ggactccatc 


tttgtgctgt 


gccgcggcct 


cacggccgcc 


2700 


gggctgacgg 


ccgtgggcga 


caagatggcg 


agccacatct 


cgcgcgcgct 


gtttctgtcc 


2760 
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cccatcaaac 


tcgagtgcga 


aaagacgttc 


accaagctgc 


tgctgatcgc 


caagaaaaag 


2820 


tacatcggcg 


tcatctacgg 


gggtaagatg 


ctcatcaagg 


gcgtggatct 


ggtgcgcaaa 


2880 


aacaactgcg 


cgtttatcaa 


ccgcacctcc 


agggccctgg 


tcgacctgct 


gttttacgac 


2940 


gataccgtat 


ccggagcggc 


cgccgcgtta 


gccgagcgcc 


ccgcagagga 


gtggctggcg 


3000 


cgacccctgc 


ccgagggact 


gcaggcgttc 


ggggccgtcc 


tcgtagacgc 


ccatcggcgc 


3060 


atcaccgacc 


cggagaggga 


catccaggac 


tttgtcctca 


ccgccgaact 


gagcagacac 


3120 


ccgcgcgcgt 


acaccaacaa 


gcgcctggcc 


cacctgacgg 


tgtattacaa 


gctcatggcc 


3180 


cgccgcgcgc 


aggtcccgtc 


catcaaggac 


cggatcccgt 


acgtgatcgt 


ggcccagacc 


3240 


cgcgaggtag 


aggagacggt 


cgcgcggctg 


gccgccctcc 


gcgagctcga 


cgccgccgcc 


3300 


ccaggggacg 


agcccgcccc 


ccccgcggcc 


ctgccctccc 


cggccaagcg 


cccccgggag 


3360 


acgccgttgc 


atgccgaccc 


cccgggaggc 


gcgtccaagc 


cccgcaagct 


gctggtgtcc 


3420 


gagctggccg 


aggatcccgc 


atacgccatt 


gcccacggcg 


tcgccctgaa 


cacggactat 


3480 


tacttctccc 


acctgttggg 


ggcggcgtgc 


gtgacattca 


aggccctgtt 


tgggaataac 


3540 


gccaagatca 


ccgagagtct 


gttaaaaagg 


tttattcccg 


aagtgtggca 


ccccccggac 


3600 


gacgtggccg 


cgcggctccg 


ggccgcaggg 


ttcggggcgg 


tgggtgccgg 


cgctacggcg 


3660 


gaggaaactc 


gtcgaatgtt 


gcatagagcc 


tttgatactc 


tagcatga 




3708 



<210> 8 
<211> 1235 
<212> PRT 

<213> herpes simplex 
<400> 8 

Met Phe Ser Gly Gly Gly Gly Pro Leu Ser Pro Gly Gly Lys Ser Ala 
15 10 15 

Ala Arg Ala Ala Ser Gly Phe Phe Ala Pro Ala Gly Pro Arg Gly Ala 
20 25 30 

Gly Arg Gly Pro Pro Pro Cys Leu Arg Gin Asn Phe Tyr Asn Pro Tyr 
35 40 45 

Leu Ala Pro Val Gly Thr Gin Gin Lys Pro Thr Gly Pro Thr Gin Arg 
50 55 60 

His Thr Tyr Tyr Ser Glu Cys Asp Glu Phe Arg Phe He Ala Pro Arg 
65 7.0 75 80 

Val Leu Asp Glu Asp Ala Pro Pro Glu Lys Arg Ala Gly Val His Asp 
85 90 95 

Gly His Leu Lys Arg Ala Pro Lys Val Tyr Cys Gly Gly Asp Glu Arg 
100 105 110 

Asp Val Leu Arg Val Gly Ser Gly Gly Phe Trp Pro Arg Arg Ser Arg 
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115 120 125 

Leu Trp Gly Gly Val Asp His Ala Pro Ala Gly Phe Asn Pro Thr Val 
130 135 140 

Thr Val Phe His Val Tyr Asp lie Leu Glu Asn Val Glu His Ala Tyr 
145 150 155 160 

Gly Met Arg Ala Ala Gin Phe His Ala Arg Phe Met Asp Ala He Thr 
165 170 175 

Pro Thr Gly Thr Val He Thr Leu Leu Gly Leu Thr Pro Glu Gly His 
180 185 190 

Arg Val Ala Val His Val Tyr Gly Thr Arg Gin Tyr Phe Tyr Met Asn 
195 200 205 

Lys Glu Glu Val Asp Arg His Leu Gin Cys Arg Ala Pro Arg Asp Leu 
210 215 220 

Cys Glu Arg Met Ala Ala Ala Leu Arg Glu Ser Pro Gly Ala Ser Phe 
225 230 235 240 

Arg Gly He Ser Ala Asp His Phe Glu Ala Glu Val Val Glu Arg Thr 
245 250 255 

Asp. Val Tyr Tyr Tyr Glu Thr Arg Pro Ala Leu Phe Tyr Arg Val Tyr 
260 265 270 

Val Arg Ser Gly Arg Val Leu Ser Tyr Leu Cys Asp Asn Phe Cys Pro 
275 280 285 

Ala lie Lys Lys Tyr Glu Gly Gly Val Asp Ala Thr Thr Arg Phe He 
290 295 300 

Leu Asp Asn Pro Gly Phe Val Thr Phe Gly Trp Tyr Arg Leu Lys Pro 
305 310 315 320 

Gly Arg Asn Asn Thr Leu Ala Gin Pro Arg Ala Pro Met Ala Phe Gly 
325 330 335 

Thr Ser Ser Asp Val Glu Phe Asn Cys Thr Ala Asp Asn Leu Ala He 
340 345 350 

Glu Gly Gly Met Ser Asp Leu Pro Ala Tyr Lys Leu Met Cys Phe Asp 
355 360 365 

He Glu Cys Lys Ala Gly Gly Glu Asp Glu Leu Ala Phe Pro Val Ala 
370 375 380 

Gly His Pro Glu Asp Leu Val He Gin He Ser Cys Leu Leu Tyr Asp 
385 390 395 400 

Leu Ser Thr Thr Ala Leu Glu His Val Leu Leu Phe Ser Leu Gly Ser 
405 410 415 

Cys Asp Leu Pro Glu Ser His Leu Asn Glu Leu Ala Ala Arg Gly Leu 
420 425 430 

Pro Thr Pro Val Val Leu Glu Phe Asp Ser Glu Phe Glu Met Leu Leu 
435 440 445 
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Ala Phe Met Thr Leu 
450 

Tyr Asn lie lie Asn 
465 

Asp lie Tyr Lys Val 
485 

Gly Val Phe Arg Val 
500 

Ser Lys lie Lys Val 
515 

lie Thr Asp Lys lie 
530 

Glu Ala Val Leu Lys 
545 

Pro Ala Tyr Tyr Ala 
565 

Tyr Cys lie Gin Asp 
580 

Leu Pro His Leu Glu 
595 

He Thr Arg Thr He 
610 

Leu Leu Arg Leu Ala 
625 

Gly Arg Phe Arg Gly 
645 

Ala Arg Glu Asp Glu 
660 

Glu Arg Glu Glu Gly 
675 

Thr Ala Gly Arg His 
690 

Thr Ser Gly Phe His 
705 

Leu Tyr Pro Ser He 
725 

Ser Leu Arg Ala Asp 
740 

Leu Glu He Glu Val 
755 

Val Arg Glu Ser Leu 
770 



Val Lys Gin Tyr Gly Pro 
455 

Phe Asp Trp Pro Phe Leu 

470 475 

Pro Leu Asp Gly Tyr Gly 
490 

Trp Asp He Gly Gin Ser 
505 

Asn Gly Met Val Asn He 
520 

Lys Leu Ser Ser Tyr Lys 
535 

Asp Lys Lys Lys Asp Leu 
550 555 

Ala Gly Pro Ala Gin Arg 
570 

Ser Leu Leu Val Gly Gin 
585 

Leu Ser Ala Val Ala Arg 
600 

Tyr Asp Gly Gin Gin He 
615 

Asp Gin Lys Gly Phe He 
630 635 

Gly Gly Gly Glu Ala Pro 
650 

Glu Arg Pro Glu Glu Glu 
665 

Gly Gly Glu Arg Glu Pro 
680 

Val Gly Tyr Gin Gly Ala 
695 

Val Asn Pro Val Val Val 
710 715 

He Gin Ala His Asn Leu 
730 

Ala Val Ala His Leu Glu 
745 

Gly Gly Arg Arg Leu Phe 
760 

Leu Ser lie Leu Leu Arg 
775 



Glu Phe Val Thr Gly 
460 

Leu Ala Lys Leu Thr 
480 

Arg Met Asn Gly Arg 
495 

His Phe Gin Lys Arg 
510 

Asp Met Tyr Gly He 
525 

Leu Asn Ala Val Ala 
540 

Ser Tyr Arg Asp lie 
560 

Gly Val He Gly Glu 
575 

Leu Phe Phe Lys Phe 
590 

Leu Ala Gly He Asn 
605 

Arg Val Phe Thr Cys 
620 

Leu Pro Asp Thr Gin 
640 

Lys Arg Pro Ala Ala 
655 

Gly Glu Asp Glu Asp 
670 

Glu Gly Ala Arg Glu 
685 

Arg Val Leu Asp Pro 
700 

Phe Asp Phe Ala Ser 
720 

Cys Phe Ser Thr Leu 
735 

Ala Gly Lys Asp Tyr 
750 

Phe Val Lys Ala His 
765 

Asp Trp Leu Ala Met 
780 
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Arg Lys Gin lie Arg Ser Arg lie Pro Gin Ser Ser Pro Glu Glu Ala 
785 790 795 800 

Val Leu Leu Asp Lys Gin Gin Ala Ala lie Lys Val Val Cys Asn Ser 
805 810 815 

Val Tyr Gly Phe Thr Gly Ala Gin His Gly Leu Leu Pro Cys Leu His 
820 825 830 

Val Ala Ala Thr Val Thr Thr lie Gly Arg Glu Met Leu Leu Ala Thr 
835 840 845 

Arg Glu Tyr Val His Ala Arg Trp Ala Ala Phe Glu Gin Leu Leu Ala 
850 855 860 

Asp Phe Pro Glu Ala Ala Asp Met Arg Ala Pro Gly Pro Tyr Ser Met 
865 870 875 880 

Arg He He Tyr Gly Asp Thr Asp Ser lie Phe Val Leu Cys Arg Gly 
885 890 895 

Leu Thr Ala Ala Gly Leu Thr Ala Val Gly Asp Lys Met Ala Ser His 
900 905 910 

He Ser Arg Ala Leu Phe Leu Ser Pro lie Lys Leu Glu Cys Glu Lys 
915 920 925 

Thr Phe Thr Lys Leu Leu Leu He Ala Lys Lys Lys Tyr He Gly Val 
930 935 940 

He Tyr Gly Gly Lys Met Leu He Lys Gly Val Asp Leu Val Arg Lys 
945 950 955 960 

Asn Asn Cys Ala Phe He Asn Arg Thr Ser Arg Ala Leu Val Asp Leu 
965 970 975 

Leu Phe Tyr Asp Asp Thr Val Ser Gly Ala Ala Ala Ala Leu Ala Glu 
980 985 990 

Arg Pro Ala Glu Glu Trp Leu Ala Arg Pro Leu Pro Glu Gly Leu Gin 
995 1000 1005 

Ala Phe Gly Ala Val Leu Val Asp Ala His Arg Arg He Thr Asp 
1010 1015 1020 

Pro Glu Arg Asp He Gin Asp Phe Val Leu Thr Ala Glu Leu Ser 
1025 1030 1035 

Arg His Pro Arg Ala Tyr Thr Asn Lys Arg Leu Ala His Leu Thr 
1040 1045 1050 

Val Tyr Tyr Lys Leu Met Ala Arg Arg Ala Gin Val Pro Ser He 
1055 1060 1065 

Lys Asp Arg He Pro Tyr Val He Val Ala Gin Thr Arg Glu Val 
1070 1075 1080 

Glu Glu Thr Val Ala Arg Leu Ala Ala Leu Arg Glu Leu Asp Ala 
1085 1090 1095 

Ala Ala Pro Gly Asp Glu Pro Ala Pro Pro Ala Ala Leu Pro Ser 



24 



WO 02/06513 



PCT/US01/16525 



1100 



1105 



1110 



Pro Ala Lys Arg Pro Arg Glu Thr Pro Leu His Ala Asp 
1115 1120 1125 



Pro Pro 



Gly Gly Ala Ser Lys Pro Arg Lys Leu Leu Val Ser Glu 
1130 1135 1140 



Leu Ala 



Glu Asp Pro Ala Tyr Ala lie Ala His Gly Val Ala Leu 
1145 1150 1155 



Asn Thr 



Asp Tyr Tyr Phe Ser His Leu Leu Gly Ala Ala Cys Val 
1160 1165 1170 



Thr Phe 



Lys Ala Leu Phe Gly Asn Asn Ala Lys lie Thr Glu Ser 
1175 1180 1185 



Leu Leu 



Lys Arg Phe lie Pro Glu Val Trp His Pro Pro Asp Asp 
1190 1195 1200 



Val Ala 



Ala Arg Leu Arg Ala Ala Gly Phe Gly Ala Val Gly Ala 
1205 1210 1215 



Gly Ala 



Thr Ala Glu Glu Thr Arg Arg Met Leu His Arg Ala Phe 
1220 1225 1230 



Asp Thr 



Leu Ala 
1235 

<210> 9 
<211> 3708 
<212> DNA 

<213> herpes simplex 
<400> 9 

atgttttccg gtggcggcgg cccgctgtcc cccggaggaa agtcggcggc cagggcggcg 60 

tccgggtttt ttgcgcccgc cggccctcgc ggagccggcc ggggaccccc gccttgtttg 120 

aggcaaaact tttacaaccc ctacctcgcc ccagtcggga cgcaacagaa gccgaccggg 18 0 

ccaacccagc gccatacgta ctatagcgaa tgcgatgaat ttcgattcat cgccccgcgg 2 40 

gtgctggacg aggatgcccc cccggagaag cgcgccgggg tgcacgacgg tcacctcaag 300 

cgcgccccca aggtgtactg cgggggggac gagcgcgacg tcctccgcgt cgggtcgggc 3 60 

ggcttctggc cgcggcgctc gcgcctgtgg ggcggcgtgg accacgcccc ggcggggttc 42 0 

aaccccaccg tcaccgtctt tcacgtgtat gacatcctgg agaacgtgga gcacgcgtac 48 0 

ggcatgcgcg cggcccagtt ccacgcgcgg tttatggacg ccatcacacc gacggggacc 540 

gtcatcacgc tcctgggcct gactccggaa ggccaccggg tggccgttca cgtttacggc 60 0 

acgcggcagt acttttacat gaacaaggag gaggttgaca ggcacctaca atgccgcgcc 660 

ccacgagatc tctgcgagcg catggccgcg gccctgcgcg agtccccggg cgcgtcgttc 72 0 

cgcggcatct ccgcggacca cttcgaggcg gaggtggtgg agcgcaccga cgtgtactac 780 

tacgagacgc gccccgctct gttttaccgc gtctacgtcc gaagcgggcg cgtgctgtcg 840 
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tacctgtgcg 


acaacttctg 


cccggccatc 


aagaagtacg 


agggtggggt 


cgacgccacc 


900 


acccggttca 


tcctggacaa 


ccccgggttc 


gtcaccttcg 


gctggtaccg 


tctcaaaccg 


960 


ggccggaaca 


acacgctagc 


ccagccgcgg 


gccccgatgg 


ccttcgggac 


atccagcgat 


1020 


gtcgagttta 


actgtacggc 


ggacaacctg 


gccatcgagg 


ggggcatgag 


cgacctaccg 


1080 


gcatacaagc 


tcatgtgctt 


cgatatcgaa 


tgcaaggcgg 


ggggggagga 


cgagctggcc 


1140 


tttccggtgg 


ccgggcaccc 


ggaggacctg 


gtcatccaga 


tatcctgtct 


gctctacgac 


1200 


ctgtccacca 


ccgccctgga 


gcacgtcctc 


ctgttttcgc 


tcggttcctg 


cgacctcccc 


1260 


gaatcccacc 


tgaacgagct 


ggcggccagg 


ggcctgccca 


cgcccgtggt 


tctggaattc 


1320 


gacagcgaat 


tcgagatgct 


gttggccttc 


atgacccttg 


tgaaacagta 


cggccccgag 


1380 


ttcgtgaccg 


ggtacaacat 


aatcaacttc 


gactggccct 


tcttgctggc 


caagctgacg 


1440 


gacatttaca 


aggtccccct 


ggacgggtac 


ggccgcatga 


acggccgggg 


cgtgtttcgc 


1500 


gtgtgggaca 


taggccagag 


ccacttccag 


aagcgcagca 


agataaaggt 


gaacggcatg 


1560 


gtgaacatcg 


acatgtacgg 


gattataacc 


gacaagatca 


agctctcgag 


ctacaagctc 


1620 


aacgccgtgg 


ccgaagccgt 


ccfcgaaggac 


aagaagaagg 


acctgagcta 


tcgcgacatc 


1680 


cccacctact 


acgccgccgg 


gcccgcgcaa 


cgcggggtga 


tcggcgagta. 


ctgcatacag 


1740 


gattccctgc 


tggtgggcca 


gctgtttttt 


aagtttttgc 


cccatctgga 


gctctcggcc 


1800 


gtcgcgcgct 


tggcgggtat 


taacatcacc 


cgcaccatct 


acgacggcca 


gcagatccgc 


1860 


gtctttacgt 


gcctgctgcg 


cctggccgac 


cagaagggct 


ttattctgcc 


ggacacccag 


1920 


gggcgattta 


ggggcgccgg 


gggggaggcg 


cccaagcgtc 


cggccgcagc 


ccgggaggac 


1980 


gaggagcggc 


cagaggagga 


gggggaggac 


gagaacgaac 


gcgaggaggg 


cgggggcgag 


2040 


cgggagccgg 


agggcgcgcg 


ggagaccgcc 


ggccggcacg 


tggggtacca 


gggggccagg 


2100 


gtccttgacc 


ccacttccgg 


gtttcacgtg 


aaccccgtgg 


tggtgttcga 


ctttgccagc 


2160 


ctgtacccca 


gcatcatcca 


ggcccacaac 


ctgtgcttca 


gcacgctctc 


cctgagggcc 


2220 


gacgcagtgg 


cgcacctgga 


ggcgggcaag 


gactacctgg 


agatcgaggt 


gggggggcga 


2280 


cggctgttct 


tcgtcaaggc 


tcacgtgcga 


gagagcctcc 


tcagcatcct 


cctgcgggac 


2340 


tggctcgcca 


tgcgaaagca 


gatccgctcg 


cggattcccc 


agagcagccc 


cgaggaggcc 


2400 


gtgctcctgg 


acaagcagca 


ggccgccatc 


aaggtcgtgt 


gtaactcggt 


ttacgggttc 


2460 


acgggagcgc 


agcacggact 


cctgccgtgc 


ctgcacgttg 


ccgcgacggt 


gacgaccatc 


2520 


ggccgcgaga 


tgctgctcgc 


gacccgcgag 


tacgtccacg 


cgcgctgggc 


ggccttcgaa 


2580 


cagctcctgg 


ccgatttccc 


ggaggcggcc 


gacatgcgcg 


cccccgggcc 


ctattccatg 


2640 


cgcatcatct 


acggggacac 


ggactccata 


tttgtgctgt 


gccgcggcct 


cacggccgcc 


2700 
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gggctgacgg 


ccgtgggcga 


caagatggcg 


agccacatct 


cgcgcgcgct 


gtttctgccc 


2760 


cccatcaaac 


tcgagtgcga 


aaagacgttc 


accaagctgc 


tgctgatcgc 


caagaaaaag 


2820 


tacatcggcg 


tcatctacgg 


gggtaagatg 


ctcatcaagg 


gcgtggatct 


ggtgcgcaaa 


2880 


aacaactgcg 


cgtttatcaa 


ccgcacctcc 


agggccctgg 


tcgacctgct 


gttttacgac 


2940 


gataccgtat 


ccggagcggc 


cgccgcgtta 


gccgagcgcc 


ccgcagagga 


gtggctggcg 


3000 


cgacccctgc 


ccgagggact 


gcaggcgttc 


ggggccgtcc 


tcgtagacgc 


ccatcggcgc 


3060 


atcaccgacc 


cggagaggga 


catccaggac 


tttgttctca 


ccgccgaact 


gagcagacac 


3120 


ccgcgcgcgt 


acaccaacaa 


gcgcctggcc 


cacctgacgg 


tgtattacaa 


gctcatggcc 


3180 


cgccgcgcgc 


aggtcccgtc 


catcaaggac 


cggatcccgt 


acgtgatcgt 


ggcccagacc 


3240 


cgcgaggtag 


aggagacggt 


cgcgcggctg 


gccgccctcc 


gcgagctaga 


cgccgccgcc 


3300 


ccaggggacg 


agcccgcccc 


ccccgcggcc 


ctgccctccc 


cggccaagcg 


cccccgggag 


3360 


acgccgtcgc 


ctgccgaccc 


cccgggaggc 


gcgtccaagc 


cccgcaagct 


gctggtgtcc 


3420 


gagctggccg 


aggatcccgc 


atacgccatt 


gcccacggcg 


tcgccctgaa 


cacggactat 


3480 


tacttctccc 


acctgttggg 


ggcggcgtgc 


gtgacattca 


aggccctgtt 


tgggaataac 


3540 


gccaagatca 


ccgagagtct 


gttaaaaagg 


tttattcccg 


aagtgtggca 


ccccccggac 


3600 


gacgtggccg 


cgcggctccg 


gaccgcaggg 


ttcggggcgg 


tgggtgccgg 


cgctacggcg 


3660 


gaggaaactc 


gtcgaatgtt 


gcatagagcc 


tttgatactc 


tagcatga 




3708 



<210> 10 
<211> 1235 
<212> PRT 

<213> herpes simplex 
<400> 10 

Met Phe Ser Gly Gly Gly Gly Pro Leu Ser Pro Gly Gly Lys Ser Ala 
15 10 15 

Ala Arg Ala Ala Ser Gly Phe Phe Ala Pro Ala Gly Pro Arg Gly Ala 
20 25 30 

Gly Arg Gly Pro Pro Pro Cys Leu Arg Gin Asn Phe Tyr Asn Pro Tyr 
35 40 45 

Leu Ala Pro Val Gly Thr Gin Gin Lys Pro Thr Gly Pro Thr Gin Arg 
50 55 60 

His Thr Tyr Tyr Ser Glu Cys Asp Glu Phe Arg Phe lie Ala Pro Arg 
65 70 75 80 

Val Leu Asp Glu Asp Ala Pro Pro Glu Lys Arg Ala Gly Val His Asp 
85 90 95 

Gly His Leu Lys Arg Ala Pro Lys Val Tyr Cys Gly Gly Asp Glu Arg 
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100 105 110 

Asp Val Leu Arg Val Gly Ser Gly Gly Phe Trp Pro Arg Arg Ser Arg 
115 120 125 

Leu Trp Gly Gly Val Asp His Ala Pro Ala Gly Phe Asn Pro Thr Val 
130 135 140 

Thr Val Phe His Val Tyr Asp lie Leu Glu Asn Val Glu His Ala Tyr 
145 150 155 160 

Gly Met Arg Ala Ala Gin Phe His Ala Arg Phe Met Asp Ala lie Thr 
165 170 175 

Pro Thr Gly Thr Val lie Thr Leu Leu Gly Leu Thr Pro Glu Gly His 
180 185 190 

Arg Val Ala Val His Val Tyr Gly Thr Arg Gin Tyr Phe Tyr Met Asn 
195 200 205 

Lys Glu Glu Val Asp Arg His Leu Gin Cys Arg Ala Pro Arg Asp Leu 
210 215 220 

Cys Glu Arg Met Ala Ala Ala Leu Arg Glu Ser Pro Gly Ala Ser Phe 
225 230 235 240 

Arg Gly He Ser Ala Asp His Phe Glu Ala Glu Val Val Glu Arg Thr 
245 250 255 

Asp Val Tyr Tyr Tyr Glu Thr Arg Pro Ala Leu Phe Tyr Arg Val Tyr 
260 265 270 

Val Arg Ser Gly Arg Val Leu Ser Tyr Leu Cys Asp Asn Phe Cys Pro 
275 280 285 

Ala He Lys Lys Tyr Glu Gly Gly Val Asp Ala Thr Thr Arg Phe lie 
290 295 300 

Leu Asp Asn Pro Gly Phe Val Thr Phe Gly Trp Tyr Arg Leu Lys Pro 
305 310 315 320 

Gly Arg Asn Asn Thr Leu Ala Gin Pro Arg Ala Pro Met Ala Phe Gly 
325 330 335 

Thr Ser Ser Asp Val Glu Phe Asn Cys Thr Ala Asp Asn Leu Ala lie 
340 345 350 

Glu Gly Gly Met Ser Asp Leu Pro Ala Tyr Lys Leu Met Cys Phe Asp 
355 360 365 

He Glu Cys Lys Ala Gly Gly Glu Asp Glu Leu Ala Phe Pro Val Ala 
370 375 380 

Gly His Pro Glu Asp Leu Val He Gin He Ser Cys Leu Leu Tyr Asp 
385 390 395 400 

Leu Ser Thr Thr Ala Leu Glu His Val Leu Leu Phe Ser Leu Gly Ser 
405 410 415 

Cys Asp Leu Pro Glu Ser His Leu Asn Glu Leu Ala Ala Arg Gly Leu 
420 425 430 



28 



WO 02/06513 



PCT/US01/16525 



Pro Thr Pro Val Val Leu Glu Phe Asp Ser Glu Phe Glu Met Leu Leu 

435 440 445 

Ala Phe Met Thr Leu Val Lys Gin Tyr Gly Pro Glu Phe Val Thr Gly 

450 455 460 

Tyr Asn lie lie Asn Phe Asp Trp Pro Phe Leu Leu Ala Lys Leu Thr 

465 470 475 480 



Asp lie Tyr Lys Val Pro Leu Asp Gly Tyr Gly Arg Met Asn Gly Arg 
485 490 495 

Gly Val Phe Arg Val Trp Asp lie Gly Gin Ser His Phe Gin Lys Arg 
500 505 510 

Ser Lys lie Lys Val Asn Gly Met Val Asn lie Asp Met Tyr Gly lie 
515 520 525 

lie Thr Asp Lys lie Lys Leu Ser Ser Tyr Lys Leu Asn Ala Val Ala 
530 535 540 

Glu Ala Val Leu Lys Asp Lys Lys Lys Asp Leu Ser Tyr Arg Asp lie 
545 550 555 560 

Pro Thr Tyr Tyr Ala Ala Gly Pro Ala Gin Arg Gly Val lie Gly Glu 
565 570 575 

Tyr Cys lie Gin Asp Ser Leu Leu Val Gly Gin Leu Phe Phe Lys Phe 
580 585 590 

Leu Pro His Leu Glu Leu Ser Ala Val Ala Arg Leu Ala Gly lie Asn 
595 600 605 

He Thr Arg Thr He Tyr Asp Gly Gin Gin lie Arg Val Phe Thr Cys 
610 615 620 

Leu Leu Arg Leu Ala Asp Gin Lys Gly Phe He Leu Pro Asp Thr Gin 
625 630 635 640 



Gly Arg Phe Arg Gly Ala Gly Gly Glu Ala Pro Lys Arg Pro Ala Ala 
645 650 655 

Ala Arg Glu Asp Glu Glu Arg Pro Glu Glu Glu Gly Glu" Asp Glu Asn 
660 665 670 



Glu Arg Glu Glu Gly Gly Gly Glu 
675 680 

Thr Ala Gly Arg His Val Gly Tyr 
690 695 

Thr Ser Gly Phe His Val Asn Pro 
705 710 

Leu Tyr Pro Ser He He Gin Ala 
725 



Arg Glu Pro Glu Gly Ala Arg Glu 
685 

Gin Gly Ala Arg Val Leu Asp Pro 
700 

Val Val Val Phe Asp Phe Ala Ser 
715 720 

His Asn Leu Cys Phe Ser Thr Leu 
730 735 



Ser Leu Arg Ala Asp Ala Val Ala 
740 

Leu Glu He Glu Val Gly Gly Arg 
755 760 



His Leu Glu Ala Gly Lys Asp Tyr 
745 750 

Arg Leu Phe Phe Val Lys Ala His 
765 
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Val Arg Glu Ser Leu Leu Ser lie Leu Leu Arg Asp Trp Leu Ala Met 
770 775 780 

Arg Lys Gin lie Arg Ser Arg lie Pro Gin Ser Ser Pro Glu Glu Ala 
785 790 795 800 

Val Leu Leu Asp Lys Gin Gin Ala Ala lie Lys Val Val Cys Asn Ser 
805 810 815 

Val Tyr Gly Phe Thr Gly Ala Gin His Gly Leu Leu Pro Cys Leu His 
820 825 830 

Val Ala Ala Thr Val Thr Thr He Gly Arg Glu Met Leu Leu Ala Thr 
835 840 845 

Arg Glu Tyr Val His Ala Arg Trp Ala Ala Phe Glu Gin Leu Leu Ala 
850 855 860 

Asp Phe Pro Glu Ala Ala Asp Met Arg Ala Pro Gly Pro Tyr Ser Met 
865 870 . 875 880 

Arg He He Tyr Gly Asp Thr Asp Ser lie Phe Val Leu Cys Arg Gly 
885 890 895 

Leu Thr Ala Ala Gly Leu Thr Ala Val Gly Asp Lys Met Ala Ser His 
900 905 910 

He Ser Arg Ala Leu Phe Leu Pro Pro He Lys Leu Glu Cys Glu Lys 
915 920 925 

Thr Phe Thr Lys Leu Leu Leu He Ala Lys Lys Lys Tyr He Gly Val 
930 935 940 

He Tyr Gly Gly Lys Met Leu He Lys Gly Val Asp Leu Val Arg Lys 
945 950 955 960 

Asn Asn Cys Ala Phe He Asn Arg Thr Ser Arg Ala Leu Val Asp Leu 
965 970 975 

Leu Phe Tyr Asp Asp Thr Val Ser Gly Ala Ala Ala Ala Leu Ala Glu 
980 985 990 , 

Arg Pro Ala Glu Glu Trp Leu Ala Arg Pro Leu Pro Glu Gly Leu Gin 
995 1000 1005 

Ala Phe Gly Ala Val Leu Val Asp Ala His Arg Arg He Thr Asp 
1010 1015 1020 

Pro Glu Arg Asp He Gin Asp Phe Val Leu Thr Ala Glu Leu Ser 
1025 1030 1035 

Arg His Pro Arg Ala Tyr Thr Asn Lys Arg Leu Ala His Leu Thr 
1040 1045 1050 

Val Tyr Tyr Lys Leu Met Ala Arg Arg Ala Gin Val Pro Ser He 
1055 1060 1065 

Lys Asp Arg He Pro Tyr Val He Val Ala Gin Thr 'Arg Glu Val 
1070 1075 1080 

Glu Glu Thr Val Ala Arg Leu Ala Ala Leu Arg Glu Leu Asp Ala 
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1085 1090 1095 

Ala Ala Pro Gly Asp Glu Pro Ala Pro Pro Ala Ala Leu Pro Ser 

1100 1105 1110 

Pro Ala Lys Arg Pro Arg Glu Thr Pro Ser Pro Ala Asp Pro Pro 

1115 1120 1125 

Gly Gly Ala Ser Lys Pro Arg Lys Leu Leu Val Ser Glu Leu Ala 

1130 1135 1140 

Glu Asp Pro Ala Tyr Ala He Ala His Gly Val Ala Leu Asn Thr 

1145 1150 1155 

Asp Tyr Tyr Phe Ser His Leu Leu Gly Ala Ala Cys Val Thr Phe 

1160 1165 1170 

Lys Ala Leu Phe Gly Asn Asn Ala Lys He Thr Glu Ser Leu Leu 

1175 1180 1185 

Lys Arg Phe He Pro Glu Val Trp His Pro Pro Asp Asp Val Ala 

1190 1195 1200 

Ala Arg Leu Arg Thr Ala Gly Phe Gly Ala Val Gly Ala Gly Ala 

1205 1210 1215 

Thr Ala Glu Glu Thr Arg Arg Met Leu His Arg Ala Phe Asp Thr 

1220 1225 1230 

Leu Ala 
1235 

<210> 11 
<211> 3729 
<212> DNA 

<213> herpes simplex 
<400> 11 



atgtttttca 


acccgtatct 


gagcggcggc 


gtgaccggcg 


gtgcggtcgc 


gggtggccgg 


60 


cgtcagcgtt 


cgcagcccgg 


ctccgcgcag 


ggctcgggca 


agcggccgcc 


acagaaacag 


120 


tttttgcaga 


tcgtgccgcg 


aggtgtcatg 


ttcgacggtc 


agacggggtt 


gatcaagcat 


180 


aagacgggac 


ggctgcctct 


catgttctat 


cgagagatta 


aacatttgtt 


gagtcatgac 


240 


atggtttggc 


cgtgtccttg 


gcgcgagacc 


ctggtgggtc 


gcgtggtggg 


acctattcgt 


300 


tttcacacct 


acgatcagac 


ggacgccgtg 


ctcttcttcg 


actcgcccga 


aaacgtgtcg 


360 


ccgcgctatc 


gtcagcatct 


ggtgccttcg 


gggaacgtgt 


tgcgtttctt 


cggggccaca 


420 


gaacacggct 


acagtatctg 


cgtcaacgtt 


ttcgggcagc 


gcagctactt 


ttactgtgag 


480 


tacagcgaca 


ccgataggct 


gcgtgaggtc 


attgccagcg 


tgggcgaact 


agtgcccgaa 


540 


ccgcggacgc 


catacgccgt 


gtctgtcacg 


ccggccacca 


agacctccat 


ctatgggtac 


6.00 


gggacgcgac 


ccgtgcccga 


tttgcagtgt 


gtgtctatca 


gcaactggac 


catggccaga 


660 


aaaatcggcg 


agtatctgct 


ggagcagggt 


tttcccgtgt 


acgaggtccg 


tgtggatccg 


720 
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ctgacgcgtt 


tggtcatcga 


tcggcggatc 


accacgttcg 


gctggtgctc 


cgtgaatcgt 


780 


tacgactggc 


ggcagcaggg 


tcgcgcgtcg 


acttgtgata 


tcgaggtaga 


ctgcgatgtc 


840 


tctgacctgg 


tggctgtgcc 


cgacgacagc 


tcgtggccgc 


gctatcgatg 


cctgtccttc 


900 


gatatcgagt 


gcatgagcgg 


cgagggtggt 


tttccctgcg 


ccgagaagtc 


cgatgacatt 


960 


gtcattcaga 


tctcgtgcgt 


gtgctacgag 


acggggggaa 


acaccgccgt 


ggatcagggg 


1020 


atcccaaacg 


ggaacgatgg 


tcggggctgc 


acttcggagg 


gtgtgatctt 


tgggcactcg 


1080 


ggtcttcatc 


tctttacgat 


cggcacctgc 


gggcaggtgg 


gcccagacgt 


ggacgtctac 


1140 


gagttccctt 


ccgaatacga 


gctgctgctg 


ggctttatgc 


ttttctttca 


acggtacgcg 


1200 


ccggcctttg 


tgaccggtta 


caacatcaac 


tcttttgact 


tgaagtacat 


cctcacgcgt 


1260 


ctcgagtacc 


tgtataaggt 


ggactcgcag 


cgcttctgca 


agttgcctac 


ggcgcagggc 


1320 


ggccgtttct 


ttttacacag 


ccccgccgtg 


ggttttaagc 


ggcagtacgc 


cgccgctttt 


1380 


ccctcggctt 


ctcacaacaa 


tccggccagc 


acggccgcca 


ccaaggtgta 


tattgcgggt 


1440 


tcggtggtta 


tcgacatgta 


ccctgtatgc 


atggccaaga 


ctaactcgcc 


caactataag 


1500 


ctcaacacta 


tggccgagct 


ttacctgcgg 


caacgcaagg 


atgacctgtc 


ttacaaggac 


1560 


atcccgcgtt 


gtttcgtggc 


taatgccgag 


ggccgcgccc 


aggtaggccg 


ttactgtctg 


1620 


caggacgccg 


tattggtgcg 


cgatctgttc 


aacaccatta 


attttcacta 


cgaggccggg 


1680 


gccatcgcgc 


ggctggctaa 


aattccgttg 


cggcgtgtca 


tctttgacgg 


acagcagatc 


1740 


cgtatctaca 


cctcgctgct 


ggacgagtgc 


gcctgccgcg 


attttatcct 


gcccaaccac 


1800 


tacagcaaag 


gtacgacggt 


gcccgaaacg 


aatagcgttg 


ctgtgtcacc 


taacgctgct 


1860 


atcatctcta 


ccgccgctgt 


gcccggcgac 


gcgggttctg 


tggcggctat 


gtttcagatg 


1920 


tcgccgccct 


tgcaatctgc 


gccgtccagt 


caggacggcg 


tttcacccgg 


ctccggcagt 


1980 


aacagtagta 


gcagcgtcgg 


cgttttcagc 


gtcggctccg 


gcagtagtgg 


cggcgtcggc 


2040 


gtttccaacg 


acaatcacgg 


cgccggcggt 


actgcggcgg 


tttcgtacca 


gggcgccacg 


2100 


gtgtttgagc 


ccgaggtggg 


ttactacaac 


gaccccgtgg 


ccgtgttcga 


ctttgccagc 


2160 


ctctaccctt 


ccatcatcat 


ggcccacaac 


ctctgctact 


ccaccctgct 


ggtgccgggt 


2220 


ggcgagtacc 


ctgtggaccc 


cgccgacgta 


tacagcgtca 


cgctagagaa 


cggcgtgacc 


2280 


caccgctttg 


tgcgtgcttc 


ggtgcgcgtc 


tcggtgctct 


cggaactgct 


caacaagtgg 


2340 


gtttcgcagc 


ggcgtgccgt 


gcgcgaatgc 


atgcgcgagt 


gtcaagaccc 


tgtgcgccgt 


2400 


atgctgctcg 


acaaggaaca 


gatggcgctc 


aaagtaacgt 


gcaacgcttt 


ctacggtttt 


2460 


accggcgcgc 


tgaacggtat 


gatgccgtgt 


ctgcccatcg 


ccgccagcat 


cacgcgcatc 


2520 


ggtcgcgaca 


tgctagagcg 


cacggcgcgg 


ttcatcaaag 


acaacttttc 


agagccgtgt 


2580 
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tttttgcaca 


atttttttaa 


tcaggaagac 


tatgtagtgg 


gaacgcggga 


gggggattcg 


2640 


gaggagagca 


gcgcgttacc 


ggaggggctc 


gaaacatcgt 


cagggggctc 


gaacgaacgg 


2700 


cgggtggagg 


cgcgggtcat 


ctacggggac 


acggacagcg 


tgtttgtccg 


ctttcgtggc 


2760 


ctgacgccgc 


aggctctggt 


ggcgcgtggg 


cccagcctgg 


cgca.cta.cgt 


gacggcctgt 


2820 


ctttttgtgg 


agcccgtcaa 


gctggagttt 


gaaaaggtct 


tcgtctctct 


tatgatgatc 


2880 


tgcaagaaac 


gttacatcgg 


caaagtggag 


ggcgcctcgg 


gtctgagcat 


gaagggcgtg 


2940 


gatctggtgc 


gcaagacggc 


ctgcgagttc 


gtcaagggcg 


tcacgcgtga 


cgtcctctcg 


3000 


ctgctctttg 


aggatcgcga 


ggtctcggaa 


gcagccgtgc 


gcctgtcgcg 


cctctcactc 


3060 


gatgaagtca 


agaagtacgg 


cgtgccacgc 


ggtttctggc 


gtatcttacg 


ccgcttggtg 


3120 


caggcccgcg 


acgatctgta 


cctgcaccgt 


gtgcgtgtcg 


aggacctggt 


gctttcgtcg 


3180 


gtgctctcta 


aggacatctc 


gctgtaccgt 


caatctaacc 


tgccgcacat 


tgccgtcatt 


3240 


aagcgattgg 


cggcccgttc 


tgaggagcta 


ccctcggtcg 


gggatcgggt 


cttttacgtt 


3300 


ctgacggcgc 


ccggtgtccg 


gacggcgccg 


cagggttcct 


ccgacaacgg 


tgattctgta 


3360 


accgccggcg 


tggtttcccg 


gtcggacgcg 


attgatggca 


cggacgacga 


cgctgacggc 


3420 


ggcggggtag 


aggagagcaa 


caggagagga 


ggagagccgg 


caaagaagag 


ggcgcggaaa 


3480 


ccaccgtcgg 


ccgtgtgcaa 


ctacgaggta 


gccgaagatc 


cgagctacgt 


gcgcgagcac 


3540 


ggcgtgccca 


ttcacgccga 


caagtacttt 


gagcaggttc 


tcaaggctgt 


aactaacgtg 


3600 


ctgtcgcccg 


tctttcccgg 


cggcgaaacc 


gcgcgcaagg 


acaagttttt 


gcacatggtg 


3660 


ctgccgcggc 


gcttgcactt 


ggagccggct 


tttctgccgt 


acagtgtcaa 


ggcgcacgaa 


3720 



tgctgttga 3729 

<210> 12 
<211> 1242 
<212> PRT 

<213> herpes simplex 
<400> 12 

Met Phe Phe Asn Pro Tyr Leu Ser Gly Gly Val Thr Gly Gly Ala Val 
15 10 15 

Ala Gly Gly Arg Arg Gin Arg Ser Gin Pro Gly Ser Ala Gin Gly Ser 
20 25 30 

Gly Lys Arg Pro Pro Gin Lys Gin Phe Leu Gin lie Val Pro Arg Gly 
35 40 45 

Val Met Phe Asp Gly Gin Thr Gly Leu lie Lys His Lys Thr Gly Arg 
50 55 60 

Leu Pro Leu Met Phe Tyr Arg Glu lie Lys His Leu Leu Ser His Asp 
65 70 75 80 
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Met Val Trp Pro Cys Pro Trp Arg Glu Thr Leu Val Gly Arg Val Val 
85 90 95 

Gly Pro lie Arg Phe His Thr Tyr Asp Gin Thr Asp Ala Val Leu Phe 
100 105 110 

Phe Asp Ser Pro Glu Asn Val Ser Pro Arg Tyr Arg Gin His Leu Val 
115 120 125 

Pro Ser Gly Asn Val Leu Arg Phe Phe Gly Ala Thr Glu His Gly Tyr 
130 135 140 

Ser lie Cys Val Asn Val Phe Gly Gin Arg Ser Tyr Phe Tyr Cys Glu 
145 150 155 160 

Tyr Ser Asp Thr Asp Arg Leu Arg Glu Val lie Ala Ser Val Gly Glu 
165 170 175 

Leu Val Pro Glu Pro Arg Thr Pro Tyr Ala Val Ser Val Thr Pro Ala 
180 185 190 

Thr Lys Thr Ser lie Tyr Gly Tyr Gly Thr Arg Pro Val Pro Asp Leu 
195 200 205 

Gin Cys Val Ser He Ser Asn Trp Thr Met Ala Arg Lys He Gly Glu 
210 215 220 

Tyr Leu Leu Glu Gin Gly Phe Pro Val Tyr Glu Val Arg Val Asp Pro 
225 230 235 240 

Leu Thr Arg Leu Val lie Asp Arg Arg He Thr Thr Phe Gly Trp Cys 
245 250 255 

Ser Val Asn Arg Tyr Asp Trp Arg Gin Gin Gly Arg Ala Ser Thr Cys 
260 265 270 

Asp He Glu Val Asp Cys Asp Val Ser Asp Leu Val Ala Val Pro Asp 
275 280 285 

Asp Ser Ser Trp Pro Arg Tyr Arg Cys Leu Ser Phe Asp He Glu Cys 
290 295 300 

Met Ser Gly Glu Gly Gly Phe Pro Cys Ala Glu Lys Ser Asp Asp He 
305 310 315 320 

Val He Gin He Ser Cys Val Cys Tyr Glu Thr Gly Gly Asn Thr Ala 
325 330 335 

Val Asp Gin Gly He Pro Asn Gly Asn Asp Gly Arg Gly Cys Thr Ser 
340 345 350 

Glu Gly Val He Phe Gly His Ser Gly Leu His Leu Phe Thr lie Gly 
355 360 365 

Thr Cys Gly Gin Val Gly Pro Asp Val Asp Val Tyr Glu Phe Pro Ser 
370 375 380 

Glu Tyr Glu Leu Leu Leu Gly Phe Met Leu Phe Phe Gin Arg Tyr Ala 
385 390 395 400 

Pro Ala Phe Val Thr Gly Tyr Asn He Asn Ser Phe Asp Leu Lys Tyr 
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405 410 415 

lie Leu Thr Arg Leu Glu Tyr Leu Tyr Lys Val Asp Ser Gin Arg Phe 
420 425 430 

Cys Lys Leu Pro Thr Ala Gin Gly Gly Arg Phe Phe Leu His Ser Pro 
435 440 445 

Ala Val Gly Phe Lys Arg Gin Tyr Ala Ala Ala Phe Pro Ser Ala Ser 
450 455 460 

His Asn Asn Pro Ala Ser Thr Ala Ala Thr Lys Val Tyr lie Ala Gly 
465 470 475 480 

Ser Val Val lie Asp Met Tyr Pro Val Cys Met Ala Lys Thr Asn Ser 
485 490 495 

Pro Asn Tyr Lys Leu Asn Thr Met Ala Glu Leu Tyr Leu Arg Gin Arg 
500 505 510 

Lys Asp Asp Leu Ser Tyr Lys Asp lie Pro Arg Cys Phe Val Ala Asn 
515 520 525 

Ala Glu Gly Arg Ala Gin Val Gly Arg Tyr Cys Leu Gin Asp Ala Val 
530 535 540 

Leu Val Arg Asp Leu Phe Asn Thr lie Asn Phe His Tyr Glu Ala Gly 
545 550 555 560 

Ala lie Ala Arg Leu Ala Lys lie Pro Leu Arg Arg Val lie Phe Asp 
565 570 575 

Gly Gin Gin lie Arg lie Tyr Thr Ser Leu Leu Asp Glu Cys Ala Cys 
580 585 590 

Arg Asp Phe lie Leu Pro Asn His Tyr Ser Lys Gly Thr Thr Val Pro 
595 600 605 

Glu Thr Asn Ser Val Ala Val Ser Pro Asn Ala Ala lie lie Ser Thr 
610 615 620 

Ala Ala Val Pro Gly Asp Ala Gly Ser Val Ala Ala Met Phe Gin Met 
625 630 635 640 

Ser Pro Pro Leu Gin Ser Ala Pro Ser Ser Gin Asp Gly Val Ser Pro 
645 650 655 

Gly Ser Gly Ser Asn Ser Ser Ser Ser Val Gly Val Phe Ser Val Gly 
660 665 670 

Ser Gly Ser Ser Gly Gly Val Gly Val Ser Asn Asp Asn His Gly Ala 
675 680 685 

Gly Gly Thr Ala Ala Val Ser Tyr Gin Gly Ala Thr Val Phe Glu Pro 
690 695 700 

Glu Val Gly Tyr Tyr Asn Asp Pro Val Ala Val Phe Asp Phe Ala Ser 
705 710 715 720 

Leu Tyr Pro Ser lie lie Met Ala His Asn Leu Cys Tyr Ser Thr Leu 
725 730 735 
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Leu Val Pro Gly Gly Glu Tyr Pro Val Asp Pro Ala Asp Val Tyr Ser 
740 745 750 

Val Thr Leu Glu Asn Gly Val Thr His Arg Phe Val Arg Ala Ser Val 
755 760 765 

Arg Val Ser Val Leu Ser Glu Leu Leu Asn Lys Trp Val Ser Gin Arg 

770 775 780 

Arg Ala Val Arg Glu Cys Met Arg Glu Cys Gin Asp Pro Val Arg Arg 
785 790 795 800 

Met Leu Leu Asp Lys Glu Gin Met Ala Leu Lys Val Thr Cys Asn Ala 
805 810 815 

Phe Tyr Gly Phe Thr Gly Ala Leu Asn Gly Met Met Pro Cys Leu Pro 
820 825 830 

lie Ala Ala Ser lie Thr Arg lie Gly Arg Asp Met Leu Glu Arg Thr 
835 840 845 

Ala Arg Phe lie Lys Asp Asn Phe Ser Glu Pro Cys Phe Leu His Asn 

850 855 860 

Phe Phe Asn Gin Glu Asp Tyr Val Val Gly Thr Arg Glu Gly Asp Ser 
865 870 875 880 

Glu Glu Ser Ser Ala Leu Pro Glu Gly Leu Glu Thr Ser Ser Gly Gly 
885 890 895 

Ser Asn Glu Arg Arg Val Glu Ala Arg Val lie Tyr Gly Asp Thr Asp 
900 905 910 

Ser Val Phe Val Arg Phe Arg Gly Leu Thr Pro Gin Ala Leu Val Ala 
915 920 925 

Arg Gly Pro Ser Leu Ala His Tyr Val Thr Ala Cys Leu Phe Val Glu 

930 935 940 

Pro Val Lys Leu Glu Phe Glu Lys Val Phe Val Ser Leu Met Met lie 
945 950 955 960 

Cys Lys Lys Arg Tyr lie Gly Lys Val Glu Gly Ala Ser Gly Leu Ser 
965 970 : 975 

Met Lys Gly Val Asp Leu Val Arg Lys Thr Ala Cys Glu Phe Val Lys 
980 985 990 

Gly Val Thr Arg Asp Val Leu Ser Leu Leu Phe Glu Asp Arg Glu Val 
995 1000 1005 

Ser Glu Ala Ala Val Arg Leu Ser Arg Leu Ser Leu Asp Glu Val 
1010 1015 1020 

Lys Lys Tyr Gly Val Pro Arg Gly Phe Trp Arg lie Leu Arg Arg 
1025 1030 1035 

Leu Val Gin Ala Arg Asp Asp Leu Tyr Leu His Arg Val Arg Val 
1040 1045 1050 

Glu Asp Leu Val Leu Ser Ser Val Leu Ser Lys Asp lie Ser Leu 
1055 1060 1065 



36 



WO 02/06513 



PCT/US01/16525 



Tyr Arg Gin Ser Asn Leu Pro His lie Ala Val lie Lys Arg Leu 
1070 1075 1080 

Ala Ala Arg Ser Glu Glu Leu Pro Ser Val Gly Asp Arg Val Phe 
1085 1090 1095 

Tyr Val Leu Thr Ala Pro Gly Val Arg Thr Ala Pro Gin Gly Ser 
1100 1105 1110 

Ser Asp Asn Gly Asp Ser Val Thr Ala Gly Val Val Ser Arg Ser 
1115 1120 1125 

Asp Ala lie Asp Gly Thr Asp Asp Asp Ala Asp Gly Gly Gly Val 
1130 1135 1140 

Glu Glu Ser Asn Arg Arg Gly Gly Glu Pro Ala Lys Lys Arg Ala 
1145 1150 1155 

Arg Lys Pro Pro Ser Ala Val Cys Asn Tyr Glu Val Ala Glu Asp 
1160 1165 1170 

Pro Ser Tyr Val Arg Glu His Gly Val Pro lie His Ala Asp Lys 
1175 1180 1185 

Tyr Phe Glu Gin Val Leu Lys Ala Val Thr Asn Val Leu Ser Pro 
1190 1195 1200 

Val Phe Pro Gly Gly Glu Thr Ala Arg Lys Asp Lys Phe Leu His 
1205 1210 1215 

Met Val Leu Pro Arg Arg Leu His Leu Glu Pro Ala Phe Leu Pro 
1220 1225 1230 

Tyr Ser Val Lys Ala His Glu Cys Cys 
1235 1240 

<210> 13 
<211> 1242 
<212> PRT 

<213> herpes simplex 
<400> 13 

Met Phe Phe Asn Pro Tyr Leu Ser Gly Gly Val Thr Gly Gly Ala Val 
15 10 15 

Ala Gly Gly Arg Arg Gin Arg Ser Gin Pro Gly Ser Ala Gin Gly Ser 
20 25 30 

Gly Lys Arg Pro Pro Gin Lys Gin Phe Leu Gin lie Val Pro Arg Gly 
35 40 45 

Val Met Phe Asp Gly Gin Thr Gly Leu lie Lys His Lys Thr Gly Arg 
50 55 60 

Leu Pro Leu Met Phe Tyr Arg Glu lie Lys His Leu Leu Ser His Asp 
65 70 75 80 

Met Val Trp Pro Cys Pro Trp Arg Glu Thr Leu Val Gly Arg Val Val 
85 90 95 
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Gly Pro lie Arg Phe His Thr Tyr Asp Gin Thr Asp Ala Val Leu Phe 
100 105 110 

Phe Asp Ser Pro Glu Asn Val Ser Pro Arg Tyr Arg Gin His Leu Val 
115 120 125 

Pro Ser Gly Asn Val Leu Arg Phe Phe Gly Ala Thr Glu His Gly Tyr 
130 135 140 

Ser He Cys Val Asn Val Phe Gly Gin Arg Ser Tyr Phe Tyr Cys Glu 
145 150 155 160 

Tyr Ser Asp Thr Asp Arg Leu Arg Glu Val He Ala Ser Val Gly Glu 
165 170 175 

Leu Val Pro Glu Pro Arg Thr Pro Tyr Ala Val Ser Val Thr Pro Ala 
180 185 190 

Thr Lys Thr Ser He Tyr Gly Tyr Gly Thr Arg Pro Val Pro Asp Leu 
195 200 205 

Gin Cys Val Ser He Ser Asn Trp Thr Met Ala Arg Lys He Gly Glu 
210 215 220 

Tyr Leu Leu Glu Gin Gly Phe Pro Val Tyr Glu Val Arg Val Asp Pro 
225 230 235 240 

Leu Thr Arg Leu Val He Asp Arg Arg He Thr Thr Phe Gly Trp Cys 
245 250 255 

Ser Val Asn Arg Tyr Asp Trp Arg Gin Gin Gly Arg Ala Ser Thr Cys 
260 265 270 

Asp He Glu Val Asp Cys Asp Val Ser Asp Leu Val Ala Val Pro Asp 
275 280 285 

Asp Ser Ser Trp Pro Arg Tyr Arg Cys Leu Ser Phe Asp lie Glu Cys 
290 295 300 

Met Ser Gly Glu Gly Gly Phe Pro Cys Ala Glu Lys Ser Asp Asp He 
305 310 315 320 

Val He Gin He Ser Cys Val Cys Tyr Glu Thr Gly Gly Asn Thr Ala 
325 330 335 

Val Asp Gin Gly He Pro Asn Gly Asn Asp Gly Arg Gly Cys Thr Ser 
340 345 350 

Glu Gly Val He Phe Gly His Ser Gly Leu His Leu Phe Thr He Gly 
355 360 365 

Thr Cys Gly Gin Val Gly Pro Asp Val Asp Val Tyr Glu Phe Pro Ser 
370 375 380 

Glu Tyr Glu Leu Leu Leu Gly Phe Met Leu Phe Phe Gin Arg Tyr Ala 
385 390 395 400 

Pro Ala Phe Val Thr Gly Tyr Asn He Asn Ser Phe Asp Leu Lys Tyr 
405 410 415 

He Leu Thr Arg Leu Glu Tyr Leu Tyr Lys Val Asp Ser Gin Arg Phe 
420 425 430 
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Cys Lys Leu Pro Thr Ala Gin Gly Gly Arg Phe Phe Leu His Ser Pro 
435 440 445 

Ala Val Gly Phe Lys Arg Gin Tyr Ala Ala Ala Phe Pro Ser Ala Ser 
450 455 460 

His Asn Asn Pro Ala Ser Thr Ala Ala Thr Lys Val Tyr lie Ala Gly 
465 470 475 480 

Ser Val Val lie Asp Met Tyr Pro Val Cys Met Ala Lys Thr Asn Ser 
485 490 495 

Pro Asn Tyr Lys Leu Asn Thr Met Ala Glu Leu Tyr Leu Arg Gin Arg 
500 505 510 

Lys Asp Asp Leu Ser Tyr Lys Asp lie Pro Arg Cys Phe Val Ala Asn 
515 520 525 

Ala Glu Gly Arg Ala Gin Val Gly Arg Tyr Cys Leu Gin Asp Ala Val 
530 535 540 

Leu Val Arg Asp Leu Phe Asn Thr lie Asn Phe His Tyr Glu Ala Gly 
545 550 555 560 

Ala lie Ala Arg Leu Ala Lys lie Pro Leu Arg Arg Val lie Phe Asp 
565 570 575 

Gly Gin Gin He Arg He Tyr Thr Ser Leu Leu Asp Glu Cys Ala Cys 
580 585 590 

Arg Asp Phe He Leu Pro Asn His Tyr Ser Lys Gly Thr Thr Val Pro 
595 600 605 

Glu Thr Asn Ser Val Ala Val Ser Pro Asn Ala Ala He He Ser Thr 
610 615 620 

Ala Ala Val Pro Gly Asp Ala Gly Ser Val Ala Ala Met Phe Gin Met 
625 630 635 640 

Ser Pro Pro Leu Gin Ser Ala Pro Ser Ser Gin Asp Gly Val Ser Pro 
645 650 655 

Gly Ser Gly Ser Asn Ser Ser Ser Ser Val Gly Val Phe Ser Val Gly 
660 665 670 

Ser Gly Ser Ser Gly Gly Val Gly Val Ser Asn Asp Asn His Gly Ala 
675 680 685 

Gly Gly Thr Ala Ala Val Ser Tyr Gin Gly Ala Thr Val Phe Glu Pro 
690 695 700 

Glu Val Gly Tyr Tyr Asn Asp Pro Val Ala Val Phe Asp Phe Ala Ser 
705 710 715 720 

Leu Tyr Pro Ser He lie Met Ala His Asn Leu Cys Tyr Ser Thr Leu 
725 730 735 

Leu Val Pro Gly Gly Glu Tyr Pro Val Asp Pro Ala Asp Val Tyr Ser 
740 745 750 

Val Thr Leu Glu Asn Gly Val Thr His Arg Phe Val Arg Ala Ser Val 



39 



WO 02/06513 



PCT/US01/16525 



755 760 765 

Arg Val Ser Val Leu Ser Glu Leu Leu Asn Lys Trp Val Ser Gin Arg 
770 775 780 

Arg Ala Val Arg Glu Cys Met Arg Glu Cys Gin Asp Pro Val Arg Arg 
785 790 795 800 

Met Leu Leu Asp Lys Glu Gin Met Ala Leu Lys Val Thr Cys Asn Ala 
805 810 815 

Phe Tyr Gly Phe Thr Gly Val Val Asn Gly Met Met Pro Cys Leu Pro 
820 825 830 

lie Ala Ala Ser lie Thr Arg lie Gly Arg Asp Met Leu Glu Arg Thr 
835 840 845 

Ala Arg Phe lie Lys Asp Asn Phe Ser Glu Pro Cys Phe Leu His Asn 
850 855 860 

Phe Phe Asn Gin Glu Asp Tyr Val Val Gly Thr Arg Glu Gly Asp Ser 
865 870 875 880 

Glu Glu Ser Ser Ala Leu Pro Glu Gly Leu Glu Thr Ser Ser Gly Gly 
885 890 895 

.Ser Asn Glu Arg Arg Val Glu Ala Arg Val lie Tyr Gly Asp Thr Asp 
900 905 910 

Ser Val Phe Val Arg Phe Arg Gly Leu Thr Pro Gin Ala Leu Val Ala 
915 920 925 

Arg Gly Pro Ser Leu Ala His Tyr Val Thr Ala Cys Leu Phe Val Glu 
930 935 940 

Pro Val Lys Leu Glu Phe Glu Lys Val Phe Val Ser Leu Met Met lie 
945 950 955 960 

Cys Lys Lys Arg Tyr He Gly Lys Val Glu Gly Ala Ser Gly Leu Ser 
965 970 975 

Met Lys Gly Val Asp Leu Val Arg Lys Thr Ala Cys Glu Phe Val Lys 
980 ' 985 990 

Gly Val Thr Arg Asp Val Leu Ser Leu Leu Phe Glu Asp Arg Glu Val 
995 1000 1005 

Ser Glu Ala Ala Val Arg Leu Ser Arg Leu Ser Leu Asp Glu Val 
1010 1015 1020 

Lys Lys Tyr Gly Val Pro Arg Gly Phe Trp Arg He Leu Arg Arg 
1025 1030 1035 

Leu Val Gin Ala Arg Asp Asp Leu Tyr Leu His Arg Val Arg Val 
1040 1045 1050 

Glu Asp Leu Val Leu Ser Ser Val Leu Ser Lys Asp lie Ser Leu 
1055 1060 1065 

Tyr Arg Gin Ser Asn Leu Pro His He Ala Val He Lys Arg Leu 
1070 1075 1080 
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Ala Ala Arg Ser Glu Glu Leu Pro Ser Val Gly Asp Arg Val Phe 
1085 1090 1095 

Tyr Val Leu Thr Ala Pro Gly Val Arg Thr Ala Pro Gin Gly Ser 
1100 1105 1110 

Ser Asp Asn Gly Asp Ser Val Thr Ala Gly Val Val Ser Arg Ser 
1115 1120 1125 

Asp Ala lie Asp Gly Thr Asp Asp Asp Ala Asp Gly Gly Gly Val 
1130 1135 1140 

Glu Glu Ser Asn Arg Arg Gly Gly Glu Pro Ala Lys Lys Arg Ala 
1145 1150 1155 

Arg Lys Pro Pro Ser Ala Val Cys Asn Tyr Glu Val Ala Glu Asp 
1160 1165 1170 

Pro Ser Tyr Val Arg Glu His Gly Val Pro He His Ala Asp Lys 
1175 1180 1185 

Tyr Phe Glu Gin Val Leu Lys Ala Val Thr Asn Val Leu Ser Pro 
1190 1195 1200 

Val Phe Pro Gly Gly Glu Thr Ala Arg Lys Asp Lys Phe Leu His 
1205 1210 1215 

Met Val Leu Pro Arg Arg Leu His Leu Glu Pro Ala Phe Leu Pro 
1220 1225 123 0 

Tyr Ser Val Lys Ala His Glu Cys Cys 
1235 1240 

<210> 14 
<211> 1238 
<212> PRT 

<213> herpes simplex 
<400> 14 

Met Phe Cys Ala Ala Gly Gly Pro Thr Ser Pro Gly Gly Lys Ser Ala 
15 10 15 

Ala Arg Ala Ala Ser Gly Phe Phe Ala Pro His Asn Pro Arg Gly Ala 
20 25 30 

Thr Gin Thr Ala Pro Pro Pro Cys Arg Arg Gin Asn Phe Tyr Asn Pro 
35 40 45 

His Leu Ala Gin Thr Gly Thr Gin Pro Lys Ala Pro Gly Pro Ala Gin 
50 55 60 

Arg His Thr Tyr Tyr Ser Glu Cys Asp Glu Phe Arg Phe lie Ala Pro 
65 70 75 80 

Arg Ser Leu Asp Glu Asp Ala Pro Ala Glu Gin Arg Thr Gly Val His 
85 90 95 

Asp Gly Arg Leu Arg Arg Ala Pro Lys Val Tyr Cys Gly Gly Asp Glu 
100 105 110 

Arg Asp Val Leu Arg Val Gly Pro Glu Gly Phe Trp Pro Arg Arg Leu 
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115 120 125 

Arg Leu Trp Gly Gly Ala Asp His Ala Pro Lys Gly Phe Asp Pro Thr 
130 135 140 

Val Thr Val Phe His Val Tyr Asp lie Leu Glu His Val Glu His Ala 
145 150 155 160 

Tyr Ser Met Arg Ala Ala Gin Leu His Glu Arg Phe Met Asp Ala lie 
165 170 175 

Thr Pro Ala Gly Thr Val lie Thr Leu Leu Gly Leu Thr Pro Glu Gly 
180 185 190 

His Arg Val Ala Val His Val Tyr Gly Thr Arg Gin Tyr Phe Tyr Met 
195 200 205 

Asn Lys Ala Glu Val Asp Arg His Leu Gin Cys Arg Ala Pro Arg Asp 
210 215 220 

Leu Cys Glu Arg Leu Ala Ala Ala Leu Arg Glu Ser Pro Gly Ala Ser 
225 230 235 240 

Phe Arg Gly lie Ser Ala Asp His Phe Glu Ala Glu Val Val Glu Arg 
245 250 255 

Ala Asp Val Tyr Tyr Tyr Glu Thr Arg Pro Thr Leu Tyr Tyr Arg Val 
260 265 270 

Phe Val Arg Ser Gly Arg Ala Leu Ala Tyr Leu Cys Asp Asn Phe Cys 
275 280 285 

Pro Ala lie Arg Lys Tyr Glu Gly Gly Val Asp Ala Thr Thr Arg Phe 
290 295 300 

lie Leu Asp Asn Pro Gly Phe Val Thr Phe Gly Trp Tyr Arg Leu Lys 
305 310 315 320 

Pro Gly Arg Gly Asn Ala Pro Ala Gin Pro Arg Pro Pro Thr Ala Phe 
325 330 335 

Gly Thr Ser Ser Asp Val Glu Phe Asn Cys Thr Ala Asp Asn Leu Ala 
340 345 350 

Val Glu Gly Ala Met Cys Asp Leu Pro Ala Tyr Lys Leu Met Cys Phe 
355 360 365 

Asp lie Glu Cys Lys Ala Gly Gly Glu Asp Glu Leu Ala Phe Pro Val 
370 375 380 

Ala Glu Arg Pro Glu Asp Leu Val He Gin He Ser Cys Leu Leu Tyr 
385 390 395 400 

Asp Leu Ser Thr Thr Ala Leu Glu His He Leu Leu Phe Ser Leu Gly 
405 410 415 

Ser Cys Asp Leu Pro Glu Ser His Leu Ser Asp Leu Ala Ser Arg Gly 
420 425 430 

Leu Pro Ala Pro Val Val Leu Glu Phe Asp Ser Glu Phe Glu Met Leu 
435 440 445 
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Leu Ala Phe Met Thr Phe Val Lys Gin Tyr Gly Pro Glu Phe Val Thr 
450 455 460 

Gly Tyr Asn lie lie Asn Phe Asp Trp Pro Phe Val Leu Thr Lys Leu 
465 470 475 480 

Thr Glu He Tyr Lys Val Pro Leu Asp Gly Tyr Gly Arg Met Asn Gly 
485 490 495 

Arg Gly Val Phe Arg Val Trp Asp He Gly Gin Ser His Phe Gin Lys 
500 505 510 

Arg Ser Lys He Lys Val Asn Gly Met Val Asn He Asp Met Tyr Gly 
515 520 525 

He He Thr Asp Lys Val Lys Leu Ser Ser Tyr Lys Leu Asn Ala Val 
530 535 540 

Ala Glu Ala Val Leu Lys Asp Lys Lys Lys Asp Leu Ser Tyr Arg Asp 
545 550 555 560 

He Pro Ala Tyr Tyr Ala Ser Gly Pro Ala Gin Arg Gly Val He Gly 
565 570 575 

Glu Tyr Cys Val Gin Asp Ser Leu Leu Val Gly Gin Leu Phe Phe Lys 
580 585 590 

Phe Leu Pro His Leu Glu Leu Ser Ala Val Ala Arg Leu Ala Gly He 
595 600 605 

Asn He Thr Arg Thr He Tyr Asp Gly Gin Gin He Arg Val Phe Thr 
610 615 620 

Cys Leu Leu Arg Leu Ala Gly Gin Lys Gly Phe He Leu Pro Asp Thr 
625 630 635 640 

Gin Gly Arg Phe Arg Gly Leu Asp Lys Glu Ala Pro Lys Arg Pro Ala 
645 650 655 

Val Pro Arg Gly Glu Gly Glu Arg Pro Gly Asp Gly Asn Gly Asp Glu 
660 665 670 

Asp Lys Asp Asp Asp Glu Asp Glu Asp Gly Asp Glu Arg Glu Glu Val 
675 680 685 

Ala Arg Glu Thr Gly Gly Arg His Val Gly Tyr Gin Gly Ala Arg Val 
690 695 700 

Leu Asp Pro Thr Ser Gly Phe His Val Asp Pro Val Val Val Phe Asp 
705 710 715 720 

Phe Ala Ser Leu Tyr Pro Ser He He Gin Ala His Asn Leu Cys Phe 
725 730 735 

Ser Thr Leu Ser Leu Arg Pro Glu Ala Val Ala His Leu Glu Ala Asp 
740 745 750 

Arg Asp Tyr Leu Glu He Glu Val Gly Gly Arg Arg Leu Phe Phe Val 
755 760 765 

Lys Ala His Val Arg Glu Ser Leu Leu Ser He Leu Leu Arg Asp Trp 
770 775 780 
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Leu Ala Met Arg Lys Gin lie Arg Ser Arg lie Pro Gin Ser Thr Pro 
785 790 795 800 

Glu Glu Ala Val Leu Leu Asp Lys Gin Gin Ala Ala lie Lys Val Val 
805 810 815 

Cys Asn Ser Val Tyr Gly Phe Thr Gly Val Gin His Gly Leu Leu Pro 
820 825 830 

Cys Leu His Val Ala Ala Thr Val Thr Thr lie Gly Arg Glu Met Leu 
835 840 845 

Leu Ala Thr Arg Ala Tyr Val His Ala Arg Trp Ala Glu Phe Asp Gin 
850 855 860 

Leu Leu Ala Asp Phe Pro Glu Ala Ala Gly Met Arg Ala Pro Gly Pro 
865 870 875 880 

Tyr Ser Met Arg lie lie Tyr Gly Asp Thr Asp Ser lie Phe Val Leu 
885 890 895 

Cys Arg Gly Leu Thr Ala Ala Gly Leu Val Ala Met Gly Asp Lys Met 
900 905 910 

Ala Ser His He Ser Arg Ala Leu Phe Leu Pro Pro He Lys Leu Glu 
915 920 925 

Cys Glu Lys Thr Phe Thr Lys Leu Leu Leu He Ala Lys Lys Lys Tyr 
930 935 940 

lie Gly Val He Cys Gly Gly Lys Met Leu He Lys Gly Val Asp Leu 
945 950 955 960 

Val Arg Lys Asn Asn Cys Ala Phe He Asn Arg Thr Ser Arg Ala Leu 
965 970 975 

Val Asp Leu Leu Phe Tyr Asp Asp Thr Val Ser Gly Ala Ala Ala Ala 
980 985 990 

Leu Ala Glu Arg Pro Ala Glu Glu Trp Leu Ala Arg Pro Leu Pro Glu 
995 1000 1005 

Gly Leu Gin Ala Phe Gly Ala Val Leu Val Asp Ala His Arg Arg 
1010 1015 1020 

He Thr Asp Pro Glu Arg Asp He Gin Asp Phe Val Leu Thr Ala 
1025 1030 1035 

Glu Leu Ser Arg His Pro Arg Ala Tyr Thr Asn Lys Arg Leu Ala 
1040 1045 1050 

His Leu Thr Val Tyr Tyr Lys Leu Met Ala Arg Arg Ala Gin Val 
1055 1060 1065 

Pro Ser He Lys Asp Arg He Pro Tyr Val He Val Ala Gin Thr 
1070 1075 1080 

Arg Glu Val Glu Glu Thr Val Ala Arg Leu Ala Ala Leu Arg Glu 
1085 1090 1095 

Leu Asp Ala Ala Ala Pro Gly Asp Glu Pro Ala Pro Pro Ala Ala 
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1100 1105 1110 

Leu Pro Ser Pro Ala Lys Arg Pro Arg Glu Thr Pro Ser His Ala 
1115 1120 1125 

Asp Pro Pro Gly Gly Ala Ser Lys Pro Arg Lys Leu Leu Val Ser 
1130 1135 1140 

Glu Leu Ala Glu Asp Pro Gly Tyr Ala lie Ala Arg Gly Val Pro 
1145 1150 1155 

Leu Asn Thr Asp Tyr Tyr Phe Ser His Leu Leu Gly Ala Ala Cys 
1160 1165 1170 

Val Thr Phe Lys Ala Leu Phe Gly Asn Asn Ala Lys lie Thr Glu 
1175 1180 1185 

Ser Leu Leu Lys Arg Phe lie Pro Glu Thr Trp His Pro Pro Asp 
1190 1195 1200 

Asp Val Ala Ala Arg Leu Arg Ala Ala Gly Phe Gly Pro Ala Gly 
1205 1210 1215 

Ala Gly Ala Thr Ala Glu Glu Thr Arg Arg Met Leu His Arg Ala 
1220 1225 1230 

Phe Asp Thr Leu Ala 
1235 

<210> 15 
<211> 1240 
<212> PRT 

<213> herpes simplex 
<400> 15 

Met Phe Cys Ala Ala Gly Gly Pro Ala Ser Pro Gly Gly Lys Ser Ala 
15 10 15 

Ala Arg Ala Ala Ser Gly Phe Phe Ala Pro His Asn Pro Arg Gly Ala 
20 25 30 

Thr Gin Thr Ala Pro Pro Pro Cys Arg Arg Gin Asn Phe Tyr Asn Pro 
35 40 45 

His Leu Ala Gin Thr Gly Thr Gin Pro Lys Ala Pro Gly Pro Ala Gin 
50 55 60 

Arg His Thr Tyr Tyr Ser Glu Cys Asp Glu Phe Arg Phe lie Ala Pro 
65 70 75 80 

Arg Ser Leu Asp Glu Asp Ala Pro Ala Glu Gin Arg Thr Gly Val His 
85 90 95 

Asp Gly Arg Leu Arg Arg Ala Pro Lys Val Tyr Cys Gly Gly Asp Glu 
100 105 110 

Arg Asp Val Leu Arg Val Gly Pro Glu Gly Phe Trp Pro Arg Arg Leu 
115 120 125 

Arg Leu Trp Gly Gly Ala Asp His Ala Pro Glu Gly Phe Asp Pro Thr 
130 135 140 
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Val Thr Val Phe His Val Tyr Asp He Leu Glu His Val Glu His Ala 
145 150 155 160 

Tyr Ser Met Arg Ala Ala Gin Leu His Glu Arg Phe Met Asp Ala He 
165 170 175 

Thr Pro Ala Gly Thr Val He Thr Leu Leu Gly Leu Thr Pro Glu Gly 
180 185 190 

His Arg Val Ala Val His Val Tyr Gly Thr Arg Gin Tyr Phe Tyr Met 
195 200 205 

Asn Lys Ala Glu Val Asp Arg His Leu Gin Cys Arg Ala Pro Arg Asp 
210 215 220 

Leu Cys Glu Arg Leu Ala Ala Ala Leu Arg Glu Ser Pro Gly Ala Ser 
225 230 235 240 

Phe Arg Gly He Ser Ala Asp His Phe Glu Ala Glu Val Val Glu Arg 
245 250 255 

Ala Asp Val Tyr Tyr Tyr Glu Thr Arg Pro Thr Leu Tyr Tyr Arg Val 
260 265 270 

Phe Val Arg Ser Gly Arg Ala Leu Ala Tyr Leu Cys Asp Asn Phe Cys 
275 280 285 

Pro Ala He Arg Lys Tyr Glu Gly Gly Val Asp Ala Thr Thr Arg Phe 
290 295 300 

He Leu Asp Asn Pro Gly Phe Val Thr Phe Gly Trp Tyr Arg Leu Lys 
305 310 315 320 

Pro Gly Arg Gly Asn Ala Pro Ala Gin Pro Arg Pro Pro Thr Ala Phe 
325 330 335 

Gly Thr Ser Ser Asp Val Glu Phe Asn Cys Thr Ala Asp Asn Leu Ala 
340 345 350 

Val Glu Gly Ala Met Cys Asp Leu Pro Ala Tyr Lys Leu Met Cys Phe 
355 360 365 

Asp He Glu Cys Lys Ala Gly Gly Glu Asp Glu Leu Ala Phe Pro Val 
370 375 380 

Ala Glu Arg Pro Glu Asp Leu Val He Gin He Ser Cys Leu Leu Tyr 
385 390 395 400 

Asp Leu Ser Thr Thr Ala Leu Glu His He Leu Leu Phe Ser Leu Gly 
405 410 415 

Ser Cys Asp Leu Pro Glu Ser His Leu Ser Asp Leu Ala Ser Arg Gly 
420 425 430 

Leu Pro Ala Pro Val Val Leu Glu Phe Asp Ser Glu Phe Glu Met Leu 
435 440 445 

Leu Ala Phe Met Thr Phe Val Lys Gin Tyr Gly Pro Glu Phe Val Thr 
450 455 460 

Gly Tyr Asn He He Asn Phe Asp Trp Pro Phe Val Leu Thr Lys Leu 
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465 



470 



475 



480 



Thr Glu Xle Tyr Lys Val Pro Leu Asp Gly Tyr Gly Arg Met Asn Gly 
485 490 495 

Arg Gly Vai Phe Arg Val Trp Asp He Gly Gin Ser His Phe Gin Lys 
500 505 510 

Arg Ser Lys He Lys Val Asn Gly Met Val Asn lie Asp Met Tyr Gly 
515 520 525 

He He Thr Asp Lys Val Lys Leu Ser Ser Tyr Lys Leu Asn Ala Val 
530 535 540 

Ala Glu Ala Val Leu Lys Asp Lys Lys Lys Asp Leu Ser Tyr Arg Asp 
545 550 555 560 

lie Pro Ala Tyr Tyr Ala Ser Gly Pro Ala Gin Arg Gly Val He Gly 
565 570 575 

Glu Tyr Cys Val Gin Asp Ser Leu Leu Val Gly Gin Leu Phe Phe Lys 
580 585 590 

Phe Leu Pro His Leu Glu Leu Ser Ala Val Ala Arg Leu Ala Gly He 
595 600 605 

Asn He Thr Arg Thr He Tyr Asp Gly Gin Gin He Arg Val Phe Thr 
610 615 620 

Cys Leu Leu Arg Leu Ala Gly Gin Lys Gly Phe He Leu Pro Asp Thr 
625 630 635 640 

Gin Gly Arg Phe Arg Gly Leu Asp Lys Glu Ala Pro Lys Arg Pro Ala 
645 650 655 

Val Pro Arg Gly Glu Gly Glu Arg Pro Gly Asp Gly Asn Gly Asp Glu 
660 665 670 

Asp Lys Asp Asp Asp Glu Asp Gly Asp Glu Asp Gly Asp Glu Arg Glu 
675 680 685 

Glu Val Ala Arg Glu Thr Gly Gly Arg His Val Gly Tyr Gin Gly Ala 
690 695 700 

Arg Val Leu Asp Pro Thr Ser Gly Phe His Val Asp Pro Val Val Val 
705 710 715 720 

Phe Asp Phe Ala Ser Leu Tyr Pro Ser He He Gin Ala His Asn Leu 
725 730 735 

Cys Phe Ser Thr Leu Ser Leu Arg Pro Glu Ala Val Ala His Leu Glu 
740 745 750 

Ala Asp Arg Asp Tyr Leu Glu He Glu Val Gly Gly Arg Arg Leu Phe 
755 760 765 

Phe Val Lys Ala His Val Arg Glu Ser Leu Leu Ser He Leu Leu Arg 
770 775 780 

Asp Trp Leu Ala Met Arg Lys Gin He Arg Ser Arg lie Pro Gin Ser 
785 790 795 800 
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Pro Pro Glu Glu Ala Val Leu Leu Asp Lys Gin Gin Ala Ala lie Lys 
805 810 815 

Val Val Cys Asn Ser Val Tyr Gly Phe Thr Gly Val Gin His Gly Leu 
820 825 830 

Leu Pro Cys Leu His Val Ala Ala Thr Val Thr Thr lie Gly Arg Glu 
835 840 845 

Met Leu Leu Ala Thr Arg Ala Tyr Val His Ala Arg Trp Ala Glu Phe 
850 855 860 

Asp Gin Leu Leu Ala Asp Phe Pro Glu Ala Ala Gly Met Arg Ala Pro 
865 870 875 880 

Gly Pro Tyr Ser Met Arg lie lie Tyr Gly Asp Thr Asp Ser He Phe 
885 890 895 

Val Leu Cys Arg Gly Leu Thr Ala Ala Gly Leu Val Ala Met Gly Asp 
900 905 910 

Lys Met Ala Ser His He Ser Arg Ala Leu Phe Leu Pro Pro He Lys 
915 920 925 

Leu Glu Cys Glu Lys Thr Phe Thr Lys Leu Leu Leu He Ala Lys Lys 
930 935 940 

Lys Tyr He Gly Val He Cys Gly Gly Lys Met Leu He Lys Gly Val 
945 950 955 960 

Asp Leu Val Arg Lys Asn Asn Cys Ala Phe He Asn Arg Thr Ser Arg 
965 970 975 

Ala Leu Val Asp Leu Leu Phe Tyr Asp Asp Thr Val Ser Gly Ala Ala 
980 985 990 

Ala Ala Leu Ala Glu Arg Pro Ala Glu Glu Trp Leu Ala Arg Pro Leu 
995 1000 1005 

Pro Glu Gly Leu Gin Ala Phe Gly Ala Val Leu Val Asp Ala His 
1010 1015 1020 

Arg Arg He Thr Asp Pro Glu Arg Asp He Gin Asp Phe Val Leu 
1025 1030 1035 

Thr Ala Glu Leu Ser Arg His Pro Arg Ala Tyr Thr Asn Lys Arg 
1040 1045 1050 

Leu Ala His Leu Thr Val Tyr Tyr Lys Leu Met Ala Arg Arg Ala 
1055 1060 1065 

Gin Val Pro Ser lie Lys Asp Arg He Pro Tyr Val He Val Ala 
1070 1075 1080 

Gin Thr Arg Glu Val Glu Glu Thr Val Ala Arg Leu Ala Ala Leu 
1085 1090 1095 

Arg Glu Leu Asp Ala Ala Ala Pro Gly Asp Glu Pro Ala Pro Pro 
1100 1105 1110 

Ala Ala Leu Pro Ser Pro Ala Lys Arg Pro Arg Glu Thr Pro Ser 
1115 1120 1125 
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His Ala Asp Pro Pro Gly Gly Ala Ser Lys Pro Arg Lys Leu Leu 
1130 1135 1140 

Val Ser Glu Leu Ala Glu Asp Pro Gly Tyr Ala He Ala Arg Gly 
1145 1150 1155 

Val Pro Leu Asn Thr Asp Tyr Tyr Phe Ser His Leu Leu Gly Ala 
1160 1165 1170 

Ala Cys Val Thr Phe Lys Ala Leu Phe Gly Asn Asn Ala Lys He 
1175 1180 1185 

Thr Glu Ser Leu Leu Lys Arg Phe lie Pro Glu Thr Trp His Pro 
1190 1195 1200 

Pro Asp Asp Val Ala Ala Arg Leu Arg Ala Ala Gly Phe Gly Pro 
1205 • 1210 1215 

Ala Gly Ala Gly Ala Thr Ala Glu Glu Thr Arg Arg Met Leu His 
1220 1225 1230 

Arg Ala Phe Asp Thr Leu Ala 
1235 1240 

<210> 16 
<211> 1235 
<212> PRT 

<213> herpes simplex 
<400> 16 

Met Phe Ser Gly Gly Gly Gly Pro Leu Ser Pro Gly Gly Lys Ser Ala 
15 10 15 

Ala Arg Ala Ala Ser Gly Phe Phe Ala Pro Ala Gly Pro Arg Gly Ala 
20 25 30 

Gly Arg Gly Pro Pro Pro Cys Leu Arg Gin Asn Phe Tyr Asn Pro Tyr 
35 40 45 

Leu Ala Pro Val Gly Thr Gin Gin Lys Pro Thr Gly Pro Thr Gin Arg 
50 55 60 

His Thr Tyr Tyr Ser Glu Cys Asp Glu Phe Arg Phe He Ala Pro Arg 
65 70 75 80 

Val Leu Asp Glu Asp Ala Pro Pro Glu Lys Arg Ala Gly Val His Asp 
85 90 95 

Gly His Leu Lys Arg Ala Pro Lys Val • Tyr Cys Gly Gly Asp Glu Arg 
100 105 110 

Asp Val Leu Arg Val Gly Ser Gly Gly Phe Trp Pro Arg Arg Ser Arg 
115 120 125 

Leu Trp Gly Gly Val Asp His Ala Pro Ala Gly Phe Asn Pro Thr Val 
130 135 140 

Thr Val Phe His Val Tyr Asp lie Leu Glu Asn Val Glu His Ala Tyr 
145 150 155 160 
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Gly Met Arg Ala Ala Gin Phe His Ala Arg Phe Met Asp Ala lie Thr 
165 170 175 

Pro Thr Gly Thr Val lie Thr Leu Leu Gly Leu Thr Pro Glu Gly His 
180 185 190 

Arg Val Ala Val His Val Tyr Gly Thr Arg Gin Tyr Phe Tyr Met Asn 
195 200 205 

Lys Glu Glu Val Asp Arg His Leu Gin Cys Arg Ala Pro Arg Asp Leu 
210 215 220 

Cys Glu Arg Met Ala Ala Ala Leu Arg Glu Ser Pro Gly Ala Ser Phe 
225 23 0 235 240 

Arg Gly He Ser Ala Asp His Phe Glu Ala Glu Val Val Glu Arg Thr 
245 250 255 

Asp Val Tyr Tyr Tyr Glu Thr Arg Pro Ala Leu Phe Tyr Arg Val Tyr 
260 265 270 

Val Arg Ser Gly Arg Val Leu Ser Tyr Leu Cys Asp Asn Phe Cys Pro 
275 280 285 

Ala He Lys Lys Tyr Glu Gly Gly Val Asp Ala Thr Thr Arg Phe He 
290 295 300 

Leu Asp Asn Pro Gly Phe Val Thr Phe Gly Trp Tyr Arg Leu Lys Pro 
305 310 315 320 

Gly Arg Asn Asn Thr Leu Ala Gin Pro Arg Ala Pro Met Ala Phe Gly 
325 330 335 

Thr Ser Ser Asp Val Glu Phe Asn Cys Thr Ala Asp Asn Leu Ala lie 
340 345 350 

Glu Gly Gly Met Ser Asp Leu Pro Ala Tyr Lys Leu Met Cys Phe Asp 
355 360 365 

lie Glu Cys Lys Ala Gly Gly Glu Asp Glu Leu Ala Phe Pro Val Ala 
370 375 380 

Gly His Pro Glu Asp Leu Val He Gin He Ser Cys Leu Leu Tyr Asp 
385 390 395 400 

Leu Ser Thr Thr Ala Leu Glu His Val Leu Leu Phe Ser Leu Gly Ser 
405 410 415 

Cys Asp Leu Pro Glu Ser His Leu Asn Glu Leu Ala Ala Arg Gly Leu 
420 425 430 

Pro Thr Pro Val Val Leu Glu Phe Asp Ser Glu Phe Glu Met Leu Leu 
435 440 445 

Ala Phe Met Thr Leu Val Lys Gin Tyr Gly Pro Glu Phe Val Thr Gly 
450 455 460 

Tyr Asn He He Asn Phe Asp Trp Pro Phe Leu Leu Ala Lys Leu Thr 
465 470 475 480 

Asp He Tyr Lys Val Pro Leu Asp Gly Tyr Gly Arg Met Asn Gly Arg 
485 490 495 
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Gly Val Phe Arg Val Trp Asp lie Gly Gin Ser His Phe Gin Lys Arg 
500 505 510 

Ser Lys lie Lys Val Asn Gly Met Val Asn He Asp Met Tyr Gly He 
515 520 525 

He Thr Asp Lys He Lys Leu Ser Ser Tyr Lys Leu Asn Ala Val Ala 
530 535 540 

Glu Ala Val Leu Lys Asp Lys Lys Lys Asp Leu Ser Tyr Arg Asp He 
545 550 555 560 

Pro Ala Tyr Tyr Ala Ala Gly Pro Ala Gin Arg Gly Val He Gly Glu 
565 570 575 

Tyr Cys He Gin Asp Ser Leu Leu Val Gly Gin Leu Phe Phe Lys Phe 
580 585 590 

Leu Pro His Leu Glu Leu Ser Ala Val Ala Arg Leu Ala Gly He Asn 
595 600 605 

He Thr Arg Thr He Tyr Asp Gly Gin Gin He Arg Val Phe Thr Cys 
610 615 620 

Leu Leu Arg Leu Ala Asp Gin Lys Gly Phe He Leu Pro Asp Thr Gin 
625 630 635 640 

Gly Arg Phe Arg Gly Ala Gly Gly Glu Ala Pro Lys Arg Pro Ala Ala 
645 650 655 

Ala Arg Glu Asp Glu Glu Arg Pro Glu Glu Glu Gly Glu Asp Glu Asp 
660 665 670 

Glu Arg Glu Glu Gly Gly Gly Glu Arg Glu Pro Glu Gly Ala Arg Glu 
675 680 685 

Thr Ala Gly Arg His Val Gly Tyr Gin Gly Ala Arg Val Leu Asp Pro 
690 695 700 

Thr Ser Gly Phe His Val Asn Pro Val Val Val Phe Asp Phe Ala Ser 
705 710 715 720 

Leu Tyr Pro Ser lie lie Gin Ala His Asn Leu Cys Phe Ser Thr Leu 
725 730 735 

Ser Leu Arg Ala Asp Ala Val Ala His Leu Glu Ala Gly Lys Asp Tyr 
740 745 750 

Leu Glu He Glu Val Gly Gly Arg Arg Leu Phe Phe. Val Lys Ala His 
755 760 765 

Val Arg Glu Ser Leu Leu Ser He Leu Leu Arg Asp Trp Leu Ala Met 
770 775 780 

Arg Lys Gin He Arg Ser Arg He Pro Gin Ser Ser Pro Glu Glu Ala 
785 790 795 800 

Val Leu Leu Asp Lys Gin Gin Ala Ala He Lys Val Val Cys Asn Ser 
805 810 815 

Val Tyr Gly Phe Thr Gly Val Gin His Gly Leu Leu Pro Cys Leu His 
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820 825 830 

Val Ala Ala Thr Val Thr Thr lie Gly Arg Glu Met Leu Leu Ala Thr 
835 840 845 

Arg Glu Tyr Val His Ala Arg Trp Ala Ala Phe Glu Gin Leu Leu Ala 
850 855 860 

Asp Phe Pro Glu Ala Ala Asp Met Arg Ala Pro Gly Pro Tyr Ser Met 
865 870 875 880 

Arg lie lie Tyr Gly Asp Thr Asp Ser lie Phe Val Leu Cys Arg Gly 
885 890 895 

Leu Thr Ala Ala Gly Leu Thr Ala Met Gly Asp Lys Met Ala Ser His 
900 905 910 

lie Ser Arg Ala Leu Phe Leu Pro Pro lie Lys Leu Glu Cys Glu Lys 
915 920 925 

Thr Phe Thr Lys Leu Leu Leu lie Ala Lys Lys Lys Tyr lie Gly Val 
930 935 940 

lie Tyr Gly Gly Lys Met Leu lie Lys Gly Val Asp Leu Val Arg Lys 
945 950 955 960 

Asn Asn Cys Ala Phe lie Asn Arg Thr Ser Arg Ala Leu Val Asp Leu 
965 970 975 

Leu Phe Tyr Asp Asp Thr Val Ser Gly Ala Ala Ala Ala Leu Ala Glu 
980 985 990 

Arg Pro Ala Glu Glu Trp Leu Ala Arg Pro Leu Pro Glu Gly Leu Gin 
995 1000 1005 

Ala Phe Gly Ala Val Leu Val Asp Ala His Arg Arg lie Thr Asp 
1010 1015 1020 

Pro Glu Arg Asp He Gin Asp Phe Val Leu Thr Ala Glu Leu Ser 
1025 1030 1035 

Arg His Pro Arg Ala Tyr Thr Asn Lys Arg Leu Ala His Leu Thr 
-1040- 1045 1050 

Val Tyr Tyr Lys Leu Met Ala Arg Arg Ala Gin Val Pro Ser He 
1055 1060 1065 

Lys Asp Arg He Pro Tyr Val lie Val Ala Gin Thr Arg Glu Val 
1070 1075 1080 

Glu Glu Thr Val Ala Arg Leu Ala Ala Leu Arg Glu Leu Asp Ala 
1085 1090 1095 

Ala Ala Pro Gly Asp Glu Pro Ala Pro Pro Ala Ala Leu Pro Ser 
1100 1105 1110 

Pro Ala Lys Arg Pro Arg Glu Thr Pro Ser His Ala Asp Pro Pro 
1115 1120 1125 

Gly Gly Ala Ser Lys Pro Arg Lys Leu Leu Val Ser Glu Leu Ala 
1130 1135 1140 
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Glu Asp Pro Ala Tyr Ala lie Ala His Gly Val Ala Leu Asn Thr 
1145 1150 1155 

Asp Tyr Tyr Phe Ser His Leu Leu Gly Ala Ala Cys Val Thr Phe 
1160 1165 1170 

Lys Ala Leu Phe Gly Asn Asn Ala Lys lie Thr Glu Ser Leu Leu 
1175 1180 1185 

Lys Arg Phe lie Pro Glu Val Trp His Pro Pro Asp Asp Val Ala 
1190 1195 1200 

Ala Arg Leu Arg Ala Ala Gly Phe Gly Ala Val Gly Ala Gly Ala 
1205 1210 1215 

Thr Ala Glu Glu Thr Arg Arg Met Leu His Arg Ala Phe Asp Thr 
1220 1225 1230 

Leu Ala 
1235 

<210> 17 
<211> 1235 
<212> PRT 

<213> herpes simplex 
<400> 17 

Met Phe Ser Gly Gly Gly Gly Pro Leu Ser Pro Gly Gly Lys Ser Ala 
15 10 15 

Ala Arg Ala Ala Ser Gly Phe Phe Ala Pro Ala Gly Pro Arg Gly Ala 
20 25 30 

Gly Arg Gly Pro Pro Pro Cys Leu Arg Gin Asn Phe Tyr Asn Pro Tyr 
35 40 45 

Leu Ala Pro Val Gly Thr Gin Gin Lys Pro Thr Gly Pro Thr Gin Arg 
50 55 ' 60 

His Thr Tyr Tyr Ser Glu Cys Asp Glu Phe Arg Phe lie Ala Pro Arg 
65 70 75 80 

Val Leu Asp Glu Asp Ala Pro Pro Glu Lys Arg Ala Gly Val His Asp 
85 90 95 

Gly His Leu Lys Arg Ala Pro Lys Val Tyr Cys Gly Gly Asp Glu Arg 
100 105 110 

Asp Val Leu Arg Val Gly Ser Gly Gly Phe Trp Pro Arg Arg Ser Arg 
115 120 125 

Leu Trp Gly Gly Val Asp His Ala Pro Ala Gly Phe Asn Pro Thr Val 
130 135 140 

Thr Val Phe His Val Tyr Asp lie Leu Glu Asn Val Glu His Ala Tyr 
145 150 155 160 

Gly Met Arg Ala Ala Gin Phe His Ala Arg Phe Met Asp Ala lie Thr 
165 170 175 

Pro Thr Gly Thr Val lie Thr Leu Leu Gly Leu Thr Pro Glu Gly His 
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180 185 190 

Arg Val Ala Val His Val Tyr Gly Thr Arg Gin Tyr Phe Tyr Met Asn 
195 200 205 

Lys Glu Glu Val Asp Arg His Leu Gin Cys Arg Ala Pro Arg Asp Leu 
210 215 220 

Cys Glu Arg Met Ala Ala Ala Leu Arg Glu Ser Pro Gly Ala Ser Phe 
225 230 235 240 

Arg Gly He Ser Ala Asp His Phe Glu Ala Glu Val Val Glu Arg Thr 
245 250 255 

Asp Val Tyr Tyr Tyr Glu Thr Arg Pro Ala Leu Phe Tyr Arg Val Tyr 
260 265 270 

Val Arg Ser Gly Arg Val Leu Ser Tyr Leu Cys Asp Asn Phe Cys Pro 
275 280 285 

Ala He Lys Lys Tyr Glu Gly Gly Val Asp Ala Thr Thr Arg Phe He 
290 295 300 

Leu Asp Asn Pro Gly Phe Val Thr Phe Gly Trp Tyr Arg Leu Lys Pro 
305 310 315 320 

Gly Arg Asn Asn Thr Leu Ala Gin Pro Arg Ala Pro Met Ala Phe Gly 
325 330 335 

Thr Ser Ser Asp Val Glu Phe Asn Cys Thr Ala Asp Asn Leu Ala lie 
340 345 350 

Glu Gly Gly Met Ser Asp Leu Pro Ala Tyr Lys Leu Met Cys Phe Asp 
355 360 365 

lie Glu Cys Lys Ala Gly Gly Glu Asp Glu Leu Ala Phe Pro Val Ala 
370 375 380 

Gly His Pro Glu Asp Leu Val He Gin He Ser Cys Leu Leu Tyr Asp 
385 390 395 400 

Leu Ser Thr Thr Ala Leu Glu His Val Leu Leu Phe Ser Leu Gly Ser 
405 410 415 

Cys Asp Leu Pro Glu Ser His Leu Asn Glu Leu Ala Ala Arg Gly Leu 
420 425 430 

Pro Thr Pro Val Val Leu Glu Phe Asp Ser Glu Phe Glu Met Leu Leu 
435 440 445 

Ala Phe Met Thr Leu Val Lys Gin Tyr Gly Pro Glu Phe Val Thr Gly 
450 455 460 

Tyr Asn He He Asn Phe Asp Trp Pro Phe Leu Leu Ala Lys Leu Thr 
465 470 475 480 

Asp He Tyr Lys Val Pro Leu Asp Gly Tyr Gly Arg Met Asn Gly Arg 
485 490 495 

Gly Val Phe Arg Val Trp Asp He Gly Gin Ser His Phe Gin Lys Arg 
500 505 510 
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Ser Lys lie Lys Val Asn Gly Met Val Asn lie Asp Met Tyr Gly lie 
515 520 525 

lie Thr Asp Lys lie Lys Leu Ser Ser Tyr Lys Leu Asn Ala Val Ala 
530 535 540 

Glu Ala Val Leu Lys Asp Lys Lys Lys Asp Leu Ser Tyr Arg Asp lie 
545 550 555 560 

Pro Ala Tyr Tyr Ala Ala Gly Pro Ala Gin Arg Gly Val lie Gly Glu 
565 570 575 

Tyr Cys lie Gin Asp Ser Leu Leu Val Gly Gin Leu Phe Phe Lys Phe 
580 585 590 

Leu Pro His Leu Glu Leu Ser Ala Val Ala Arg Leu Ala Gly lie Asn 
595 600 605 

lie Thr Arg Thr lie Tyr Asp Gly Gin Gin He Arg Val Phe Thr Cys 
610 615 620 

Leu Leu Arg Leu Ala Asp Gin Lys Gly Phe He Leu Pro Asp Thr Gin 
625 630 635 640 

Gly Arg Phe Arg Gly Ala Gly Gly Glu Ala Pro Lys Arg Pro Ala Ala 
645 650 655 

Ala Arg Glu Asp Glu Glu Arg Pro Glu Glu Glu Gly Glu Asp Glu Asp 
660 665 670 

Glu Arg Glu Glu Gly Gly Gly Glu Arg Glu Pro Glu Gly Ala Arg Glu 
675 680 685 

Thr Ala Gly Arg His Val Gly Tyr Gin Gly Ala Arg Val Leu Asp Pro 
690 695 700 

lie Ser* Gly Phe His Val Asn Pro Val Val Val Phe Asp Phe Ala Ser 
705 710 715 720 

Leu Tyr Pro Ser lie He Gin Ala His Asn Leu Cys Phe Ser Thr Leu 
725 730 735 

Ser Leu Arg Ala Asp Ala Val Ala His Leu Glu Ala Gly Lys Asp Tyr 
740 745 750 

Leu Glu lie Glu Val Gly Gly Arg Arg Leu Phe Phe Val Lys Ala His 
755 760 765 

Val Arg Glu Ser Leu Leu Ser lie Leu Leu Arg Asp Trp Leu Ala Met 
770 775 780 

Arg Lys Gin He Arg Ser Arg He Pro Gin Ser Ser Pro Glu Glu Ala 
785 790 795 800 

Val Leu Leu Asp Lys Gin Gin Ala Ala He Lys Val Val Cys Asn Ser 
805 810 815 

Val Tyr Gly Phe Thr Gly Val Gin His Gly Leu Leu Pro Cys Leu His 
820 825 830 

Val Ala Ala Thr Val Thr Thr He Gly Arg Glu Met Leu Leu Ala Thr 
835 840 845 
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Arg Glu Tyr Val His Ala Arg Trp Ala Ala Phe Glu Gin Leu Leu Ala 
850 855 860 

Asp Phe Pro Glu Ala Ala Asp Met Arg Ala Pro Gly Pro Tyr Ser Met 
865 870 875 880 

Arg lie lie Tyr Gly Asp Thr Asp Ser lie Phe Val Leu Cys Arg Gly 
885 890 895 

Leu Thr Ala Ala Gly Leu Thr Ala Met Gly Asp Lys Met Ala Ser His 
900 905 910 

lie Ser Arg Ala Leu Phe Leu Pro Pro lie Lys Leu Glu Cys Glu Lys 
915 920 925 

Thr Phe Thr Lys Leu Leu Leu lie Ala Lys Lys Lys Tyr lie Gly Val 
930 935 940 

lie Tyr Gly Gly Lys Met Leu lie Lys Gly Val Asp Leu Val Arg Lys 
945 950 955 960 

Asn Asn Cys Ala Phe He Asn Arg Thr Ser Arg Ala Leu Val Asp Leu 
965 970 975 

Leu Phe Tyr Asp Asp Thr Val Ser Gly Ala Ala Ala Ala Leu Ala Glu 
980 985 990 

Arg Pro Ala Glu Glu Trp Leu Ala Arg Pro Leu Pro Glu Gly Leu Gin 
995 1000 1005 

Ala Phe Gly Ala Val Leu Val Asp Ala His Arg Arg He Thr Asp 
1010 1015 1020 

Pro Glu Arg Asp lie Gin Asp Phe Val Leu Thr Ala Glu Leu Ser 
1025 1030 1035 

Arg His Pro Arg Ala Tyr Thr Asn Lys Arg Leu Ala His Leu Thr 
1040 1045 1050 

Val Tyr Tyr Lys Leu Met Ala Arg Arg Ala Gin Val Pro Ser He 
1055 1060 1065 

Lys Asp Arg He Pro Tyr Val He Val Ala Gin Thr Arg Glu Val 
1070 1075 1080 

Glu Glu Thr Val Ala Arg Leu Ala Ala Leu Arg Glu Leu Asp Ala 
1085 1090 1095 

Ala Ala Pro Gly Asp Glu Pro Ala Pro Pro Ala Ala Leu Pro Ser 
1100 1105 1110 

Pro Ala Lys Arg Pro Arg Glu Thr Pro Ser Pro Ala Asp Pro Pro 
1115 1120 1125 

Gly Gly Ala Ser Lys Pro Arg Lys Leu Leu Val Ser Glu Leu Ala 
1130 1135 1140 

Glu Asp Pro Ala Tyr Ala He Ala His Gly Val Ala Leu Asn Thr 
1145 1150 1155 

Asp Tyr Tyr Phe Ser His Leu Leu Gly Ala Ala Cys Val Thr Phe 
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1160 1165 1170 

Lys Ala Leu Phe Gly Asn Asn Ala Lys lie Thr Glu Ser Leu Leu 
1175 1180 1185 

Lys Arg Phe lie Pro Glu Val Trp His Pro Pro Asp Asp Val Thr 
1190 1195 1200 

Ala Arg Leu Arg Ala Ala Gly Phe Gly Ala Val Gly Ala Gly Ala 
1205 1210 1215 

Thr Ala Glu Glu Thr Arg Arg Met Leu His Arg Ala Phe Asp Thr 
1220 1225 1230 

Leu Ala 
1235 

<210> 18 

<211> 1235 

<212> PRT 

<213> herpes simplex 

<400> 18 

Met Phe Ser Gly Gly Gly Gly Pro Leu Ser Pro Gly Gly Lys Ser Ala 
1 5 10 15 

Ala Arg Ala Ala Ser Gly Phe Phe Ala Pro Ala Gly Pro Arg Gly Ala 
20 25 30 

Gly Arg Gly Pro Pro Pro Cys Leu Arg Gin Asn Phe Tyr Asn Pro Tyr 
35 40 45 

Leu Ala Pro Val Gly Thr Gin Gin Lys Pro Thr Gly Pro Thr Gin Arg 
50 55 60 

His Thr Tyr Tyr Ser Glu Cys Asp Glu Phe Arg Phe lie Ala Pro Arg 
65 70 75 80 

Val Leu Asp Glu Asp Ala Pro Pro Glu Lys Arg Ala Gly Val His Asp 
85 90 95 

Gly His Leu Lys Arg Ala Pro Lys Val Tyr Cys Gly Gly Asp Glu Arg 
100 105 110 

Asp Val Leu Arg Val Gly Ser Gly Gly Phe Trp Pro Arg Arg Ser Arg 
115 120 125 

Leu Trp Gly Gly Val Asp His Ala Pro Ala Gly Phe Asn Pro Thr Val 
130 135 140 

Thr Val Phe His Val Tyr Asp lie Leu Glu Asn Val Glu His Ala Tyr 
145 150 155 160 

Gly Met Arg Ala Ala Gin Phe His Ala Arg Phe Met Asp Ala lie Thr 
165 170 175 

Pro Thr Gly Thr Val lie Thr Leu Leu Gly Leu Thr Pro Glu Gly His 
180 185 190 

Arg Val Ala Val His Val Tyr Gly Thr Arg Gin Tyr Phe Tyr Met Asn 
195 200 205 



57 



WO 02/06513 



PCT/US01/16525 



Lys Glu Glu Val Asp Arg His Leu Gin Cys Arg Ala Pro Arg Asp Leu 
210 215 220 

Cys Glu Arg Met Ala Ala Ala Leu Arg Glu Ser Pro Gly Ala Ser Phe 
225 230 235 240 

Arg Gly lie Ser Ala Asp His Phe Glu Ala Glu Val Val Glu Arg Thr 
245 250 255 

Asp Val Tyr Tyr Tyr Glu Thr Arg Pro Ala Leu Phe Tyr Arg Val Tyr 
260 265 270 

Val Arg Ser Gly Arg Val Leu Ser Tyr Leu Cys Asp Asn Phe Cys Pro 
275 280 285 

Ala lie Lys Lys Tyr Glu Gly Gly Val Asp Ala Thr Thr Arg Phe lie 
290 295 300 

Leu Asp Asn Pro Gly Phe Val Thr Phe Gly Trp Tyr Arg Leu Lys Pro 
305 310 315 320 

Gly Arg Asn Asn Thr Leu Ala Gin Pro Arg Ala Pro Met Ala Phe Gly 
325 330 335 

Thr Ser Ser Asp Val Glu Phe Asn Cys Thr Ala Asp Asn Leu Ala lie 
340 345 350 

Glu Gly Gly Met Ser Asp Leu Pro Ala Tyr Lys Leu Met Cys Phe Asp 
355 360 365 

lie Glu Cys Lys Ala Gly Gly Glu Asp Glu Leu Ala Phe Pro Val Ala 
370 375 380 

Gly His Pro Glu Asp Leu Val He Gin He Ser Cys Leu Leu Tyr Asp 
385 390 395 400 

Leu Ser Thr Thr Ala Leu Glu His Val Leu Leu Phe Ser Leu Gly Ser 
405 410 415 

Cys Asp Leu Pro Glu Ser His Leu Asn Glu Leu Ala Ala Arg Gly Leu 
420 425 430 

Pro Thr Pro Val Val Leu Glu Phe Asp Ser Glu Phe Glu Met Leu Leu 
435 440 445 

Ala Phe Met Thr Leu Val Lys Gin Tyr Gly Pro Glu Phe Val Thr Gly 
450 455 460 

Tyr Asn He He Asn Phe Asp Trp Pro Phe Leu Leu Ala Lys Leu Thr 
465 470 475 480 

Asp He Tyr Lys Val Pro Leu Asp Gly Tyr Gly Arg Met Asn Gly Arg 
485 490 495 

Gly Val Phe Arg Val Trp Asp He Gly Gin Ser His Phe Gin Lys Arg 
500 505 510 

Ser Lys He Lys Val Asn Gly Met Val Asn He Asp Met Tyr Gly He 
515 520 525 

He Thr Asp Lys He Lys Leu Ser Ser Tyr Lys Leu Asn Ala Val Ala 
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530 535 540 

Glu Ala Val Leu Lys Asp Lys Lys Lys Asp Leu Ser Tyr Arg Asp lie 
545 550 555 560 

Pro Thr Tyr Tyr Ala Ala Gly Pro Ala Gin Arg Gly Val lie Gly Glu 
565 570 575 

Tyr Cys lie Gin Asp Ser Leu Leu Val Gly Gin Leu Phe Phe Lys Phe 
580 585 590 

Leu Pro His Leu Glu Leu Ser Ala Val Ala Arg Leu Ala Gly lie Asn 
595 600 605 

He Thr Arg Thr He Tyr Asp Gly Gin Gin He Arg Val Phe Thr Cys 
610 615 620 

Leu Leu Arg Leu Ala Asp Gin Lys Gly Phe He Leu Pro Asp Thr Gin 
625 630 635 640 

Gly Arg Phe Arg Gly Ala Gly Gly Glu Ala Pro Lys Arg Pro Ala Ala 
645 650 655 

Ala Arg Glu Asp Glu Glu Arg Pro Glu Glu Glu Gly Glu Asp Glu Asn 
660 665 670 

Glu Arg Glu Glu Gly Gly Gly Glu Arg Glu Pro Glu Gly Ala Arg Glu 
675 680 685 

Thr Ala Gly Arg His Val Gly Tyr Gin Gly Ala Arg Val Leu Asp Pro 
690 695 700 

Thr Ser Gly Phe His Val Asn Pro Val Val Val Phe Asp Phe Ala Ser 
705 710 715 720 

Leu Tyr Pro Ser He He Gin Ala His Asn Leu Cys Phe Ser Thr Leu 
725 730 735 

Ser Leu Arg Ala Asp Ala Val Ala His Leu Glu Ala Gly Lys Asp Tyr 
740 745 750 

Leu Glu He Glu Val Gly Gly Arg Arg Leu Phe Phe Val Lys Ala His 
755 760 765 

Val Arg Glu Ser Leu Leu Ser He Leu Leu Arg Asp Trp Leu Ala Met 
770 775 780 

Arg Lys Gin He Arg Ser Arg lie Pro Gin Ser Ser Pro Glu Glu Ala 
785 790 795 800 

Val Leu Leu Asp Lys Gin Gin Ala Ala He Lys Val Val Cys Asn Ser 
805 810 815 

Val Tyr Gly Phe Thr Gly Val Gin His Gly Leu Leu Pro Cys Leu His 
820 825 830 

Val Ala Ala Thr Val Thr Thr He Gly Arg Glu Met Leu Leu Ala Thr 
835 840 845 

Arg Glu Tyr Val His Ala Arg Trp Ala Ala Phe Glu Gin Leu Leu Ala 
850 • 855 860 
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Asp Phe Pro Glu Ala Ala Asp Met Arg Ala Pro Gly Pro Tyr Ser Met 
865 870 875 880 

Arg He He Tyr Gly Asp Thr Asp Ser He Phe Val Leu Cys Arg Gly 
885 890 895 

Leu Thr Ala Ala Gly Leu Thr Ala Val Gly Asp Lys Met Ala Ser His 
900 905 910 

He Ser Arg Ala Leu Phe Leu Pro Pro He Lys Leu Glu Cys Glu Lys 
915 920 925 

Thr Phe Thr Lys Leu Leu Leu He Ala Lys Lys Lys Tyr He Gly Val 
930 935 940 

He Tyr Gly Gly Lys Met Leu He Lys Gly Val Asp Leu Val Arg Lys 
945 950 955 960 

Asn Asn Cys Ala Phe He Asn Arg Thr Ser Arg Ala Leu Val Asp Leu 
965 970 975 

Leu Phe Tyr Asp Asp Thr Val Ser Gly Ala Ala Ala Ala Leu Ala Glu 
980 985 990 

Arg Pro Ala Glu Glu Trp Leu Ala Arg Pro Leu Pro Glu Gly Leu Gin 
995 1000 1005 

Ala Phe Gly Ala Val Leu Val Asp Ala His Arg Arg He Thr Asp 
1010 1015 1020 

Pro Glu Arg Asp He Gin Asp Phe Val Leu Thr Ala Glu Leu Ser 
1025 1030 1035 

Arg His Pro Arg Ala Tyr Thr Asn Lys Arg Leu Ala His Leu Thr 
1040 1045 1050 

Val Tyr Tyr Lys Leu Met Ala Arg Arg Ala Gin Val Pro Ser lie 
1055 1060 1065 

Lys Asp Arg He Pro Tyr Val He Val Ala Gin Thr Arg Glu Val 
1070 1075 1080 

Glu Glu Thr Val Ala Arg Leu Ala Ala Leu Arg Glu Leu Asp Ala 
1085 1090 1095 

Ala Ala Pro Gly Asp Glu Pro Ala Pro Pro Ala Ala Leu Pro Ser 
1100 1105 1110 

Pro Ala Lys Arg Pro Arg Glu Thr Pro Ser Pro Ala Asp Pro Pro 
1115 1120 1125 

Gly Gly Ala Ser Lys Pro Arg Lys Leu Leu Val Ser Glu Leu Ala 
1130 1135 1140 

Glu Asp Pro Ala Tyr Ala He Ala His Gly Val Ala Leu Asn Thr 
1145 1150 1155 

Asp Tyr Tyr Phe Ser His Leu Leu Gly Ala Ala Cys Val Thr Phe 
1160 1165 1170 

Lys Ala Leu Phe Gly Asn Asn Ala Lys He Thr Glu Ser Leu Leu 
1175 1180 1185 
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Lys Arg Phe lie Pro Glu Val Trp His Pro Pro Asp Asp Val Ala 
1190 1195 1200 

Ala Arg Leu Arg Thr Ala Gly Phe Gly Ala Val Gly Ala Gly Ala 
1205 1210 1215 

Thr Ala Glu Glu Thr Arg Arg Met Leu His Arg Ala Phe Asp Thr 
1220 1225 1230 

Leu Ala 
1235 

<210> 19 
<211> 1235 
<212> PRT 

<213> herpes simplex 
<400> 19 

Met Phe Ser Gly Gly Gly Gly Pro Leu Ser Pro Gly Gly Lys Ser Ala 
15 10 15 

Ala Arg Ala Ala Ser Gly Phe Phe Ala Pro Ala Gly Pro Arg Gly Ala 
20 25 30 

Gly Arg Gly Pro Pro Pro Cys Leu Arg Gin Asn Phe Tyr Asn Pro Tyr 
35 40 45 

Leu Ala Pro Val Gly Thr Gin Gin Lys Pro Thr Gly Pro Thr Gin Arg 
50 55 60 

His Thr Tyr Tyr Ser Glu Cys Asp Glu Phe Arg Phe lie Ala Pro Arg 
65 70 75 80 

Val Leu Asp Glu Asp Ala Pro Pro Glu Lys Arg Ala Gly Val His Asp 
85 90 95 

Gly His Leu Lys Arg Ala Pro Lys Val Tyr Cys Gly Gly Asp Glu Arg 
100 105 110 

Asp Val Leu Arg Val Gly Ser Gly Gly Phe Trp Pro Arg Arg Ser Arg 
115 120 125 

Leu Trp Gly Gly Val Asp His Ala Pro Ala Gly Phe Asn Pro Thr Val 
130 135 140 

Thr Val Phe His Val Tyr Asp lie Leu Glu Asn Val Glu His Ala Tyr 
145 150 155 160 

Gly Met Arg Ala Ala Gin Phe His Ala Arg Phe Met Asp Ala lie Thr 
165 170 175 

Pro Thr Gly Thr Val lie Thr Leu Leu Gly Leu Thr Pro Glu Gly His 
180 185 190 

Arg Val Ala Val His Val Tyr Gly Thr Arg Gin Tyr Phe Tyr Met Asn 
195 200 205 

Lys Glu Glu Val Asp Arg His Leu Gin Cys Arg Ala Pro Arg Asp Leu 
210 215 220 
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Cys Glu Arg Met Ala Ala Ala Leu Arg Glu Ser Pro Gly Ala Ser Phe 
225 230 235 240 

Arg Gly lie Ser Ala Asp His Phe Glu Ala Glu Val Val Glu Arg Thr 
245 250 255 

Asp Val Tyr Tyr Tyr Glu Thr Arg Pro Ala Leu Phe Tyr Arg Val Tyr 
260 265 270 

Val Arg Ser Gly Arg Val Leu Ser Tyr Leu Cys Asp Asn Phe Cys Pro 
275 280 285 

Ala lie Lys Lys Tyr Glu Gly Gly Val Asp Ala Thr Thr Arg Phe lie 
290 295 300 

Leu Asp Asn Pro Gly Phe Val Thr Phe Gly Trp Tyr Arg Leu Lys Pro 
305 310 315 320 

Gly Arg Asn Asn Thr Leu Ala Gin Pro Arg Ala Pro Met Ala Phe Gly 
325 330 335 

Thr Ser Ser Asp Val Glu Phe Asn Cys Thr Ala Asp Asn Leu Ala lie 
340 345 350 

Glu Gly Gly Met Ser Asp Leu Pro Ala Tyr Lys Leu Met Cys Phe Asp 
355 360 365 

lie Glu Cys Lys Ala Gly Gly Glu Asp Glu Leu Ala Phe Pro Val Ala 
370 375 380 

Gly His Pro Glu Asp Leu Val lie Gin He Ser Cys Leu Leu Tyr Asp 
385 390 395 400 

Leu Ser Thr Thr Ala Leu Glu His Val Leu Leu Phe Ser Leu Gly Ser 
405 410 415 

Cys Asp Leu Pro Glu Ser His Leu Asn Glu Leu Ala Ala Arg Gly Leu 
420 425 430 

Pro Thr Pro Val Val Leu Glu Phe Asp Ser Glu Phe Glu Met Leu Leu 
435 440 445 

Ala Phe Met Thr Leu Val Lys Gin Tyr Gly Pro Glu Phe Val Thr Gly 
450 455 460 

Tyr Asn He lie Asn Phe Asp Trp Pro Phe Leu Leu Ala Lys Leu Thr 
465 470 475 480 

Asp He Tyr Lys Val Pro Leu Asp Gly Tyr Gly Arg Met Asn Gly Arg 
485 490 495 

Gly Val Phe Arg Val Trp Asp lie Gly Gin Ser His Phe Gin Lys Arg 
500 505 510 

Ser Lys He Lys Val Asn Gly Met Val Asn He Asp Met Tyr Gly He 
515 520 525 

He Thr Asp Lys He Lys Leu Ser Ser Tyr Lys Leu Asn Ala Val Ala 
530 535 540 

Glu Ala Val Leu Lys Asp Lys Lys Lys Asp Leu Ser Tyr Arg Asp He 
545 550 555 560 
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Pro Ala Tyr Tyr 



Tyr Cys He Gin 
580 

Leu Pro His Leu 
595 

He Thr Arg Thr 
610 

Leu Leu Arg Leu 
625 

Gly Arg Phe Arg 



Ala Arg Glu Asp 
660 

Glu Arg Glu Glu 
675 

Thr Ala Gly Arg 
690 

Thr Ser Gly Phe 
705 

Leu Tyr Pro Ser 



Ser Leu Arg Ala 
740 

Leu Glu He Glu 
755 

Val Arg Glu Ser 
770 

Arg Lys Gin lie 
785 

Val Leu Leu Asp 



Val Tyr Gly Phe 
820 

Val Ala Ala Thr 
835 

Arg Glu Tyr Val 
850 

Asp Phe Pro Glu 
865 

Arg He He Tyr 



Ala Ala Gly Pro 
565 

Asp Ser Leu Leu 



Glu Leu Ser Ala 
600 

He Tyr Asp Gly 
615 

Ala Asp Gin Lys 
630 

Gly Gly Gly Gly 
645 

Glu Glu Arg Pro 



Gly Gly Gly Glu 
680 

His Val Gly Tyr 
695 

His Val Asn Pro 
710 

He He Gin Ala 
725 

Asp Ala Val Ala 



Val Gly Gly Arg 
760 

Leu Leu Ser He 
775 

Arg Ser Arg He 
790 

Lys Gin Gin Ala 
805 

Thr Gly Val Gin 



Val Thr Thr He 
840 

His Ala Arg Trp 
855 

Ala Ala Asp Met 
870 

Gly Asp Thr Asp 



Ala Gin Arg Gly 
570 

Val Gly Gin Leu 
585 

Val Ala Arg Leu 



Gin Gin He Arg 
620 

Gly Phe He Leu 
635 

Glu Ala Pro Lys 
650 

Glu Glu Glu Gly 
665 

Arg Glu Pro Glu 



Gin Gly Ala Arg 
700 

Val Val Val Phe 
715 

His Asn Leu Cys 
730 

His Leu Glu Ala 
745 

Arg Leu Phe Phe 



Leu Leu Arg Asp 
780 

Pro Gin Ser Ser 
795 

Ala He Lys Val 
810 

His Gly Leu Leu 
825 

Gly Arg Glu Met 



Ala Ala Phe Glu 
860 

Arg Ala Pro Gly 
875 

Ser He Phe Val 



Val He Gly Glu 
575 

Phe Phe Lys Phe 
590 

Ala Gly He Asn 
605 

Val Phe Thr Cys 



Pro Asp Thr Gin 
640 

Arg Pro Ala Ala 
655 

Glu Asp Glu Asp 
670 

Gly Ala Arg Glu 
685 

Val Leu Asp Pro 



Asp Phe Ala Ser 
720 

Phe Ser Thr Leu 
735 

Gly Lys Asp Tyr 
750 

Val Lys Ala His 
765 

Trp Leu Ala Met 



Pro Glu Glu Ala 
800 

Val Cys Asn Ser 
815 

Pro Cys Leu His 
830 

Leu Leu Ala Thr 
845 

Gin Leu Leu Ala 



Pro Tyr Ser Met 
880 

Leu Cys Arg Gly 
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885 890 895 

Leu Thr Ala Ala Gly Leu Thr Ala Val Gly Asp Lys Met Ala Ser His 
900 905 910 

lie Ser Arg Ala Leu Phe Leu Ser Pro lie Lys Leu Glu Cys Glu Lys 
915 920 925 

Thr Phe Thr Lys Leu Leu Leu lie Ala Lys Lys Lys Tyr lie Gly Val 
930 935 940 

lie Tyr Gly Gly Lys Met Leu lie Lys Gly Val Asp Leu Val Arg Lys 
945 950 955 960 

Asn Asn Cys Ala Phe lie Asn Arg Thr Ser Arg Ala Leu Val Asp Leu 
965 970 975 

Leu Phe Tyr Asp Asp Thr Val Ser Gly Ala Ala Ala Ala Leu Ala Glu 
980 985 990 

Arg Pro Ala Glu Glu Trp Leu Ala Arg Pro Leu Pro Glu Gly Leu Gin 
995 1000 1005 

Ala Phe Gly Ala Val Leu Val Asp Ala His Arg Arg He Thr Asp 
1010 1015 1020 

Pro Glu Arg Asp He Gin Asp Phe Val Leu Thr Ala Glu Leu Ser 
1025 1030 1035 

Arg His Pro Arg Ala Tyr Thr Asn Lys Arg Leu Ala His Leu Thr 
1040 1045 1050 

Val Tyr Tyr Lys Leu Met Ala Arg Arg Ala Gin Val Pro Ser lie 
1055 1060 1065 

Lys Asp Arg He Pro Tyr Val He Val Ala Gin Thr Arg Glu Val 
1070 1075 1080 

Glu Glu Thr Val Ala Arg Leu Ala Ala Leu Arg Glu Leu Asp Ala 
1085 1090 1095 

Ala Ala Pro Gly Asp Glu Pro Ala Pro Pro Ala Ala Leu Pro Ser 
1100 1105 1110 

Pro Ala Lys Arg Pro Arg Glu Thr Pro Leu His Ala Asp Pro Pro 
1115 1120 1125 

Gly Gly Ala Ser Lys Pro Arg Lys Leu Leu Val Ser Glu Leu Ala 
1130 1135 1140 

Glu Asp Pro Ala Tyr Ala He Ala His Gly Val Ala Leu Asn Thr 
1145 1150 1155 

Asp Tyr Tyr Phe Ser His Leu Leu Gly Ala Ala Cys Val Thr Phe 
1160 1165 1170 

Lys Ala Leu Phe Gly Asn Asn Ala Lys He Thr Glu Ser Leu Leu 
1175 1180 1185 

Lys Arg Phe He Pro Glu Val Trp His Pro Pro Asp Asp Val Ala 
1190 1195 1200 
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Ala Arg Leu Axg Ala Ala Gly Phe Gly Ala Val Gly Ala Gly Ala 
1205 ' 1210 1215 

Thr Ala Glu Glu Thr Arg Arg Met Leu His Arg Ala Phe Asp Thr 
1220 1225 1230 

Leu Ala 
1235 
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Present claims 23 and 24 relate to a compound defined by reference to a 
desirable characteristic or property, namely the change of the wild type 
HSV-1 polymerases at amino acid 823 from valine to alanine 1n the 
presence of said compound. 

The claims cover all compounds having this characteristic or property, 
whereas the application provides support within the meaning of Article 6 
PCT and/or disclosure within the meaning of Article 5 PCT for only a very 
limited number of such compounds. In the present case, the claims so lack 
support, and the application so lacks disclosure, that a meaningful 
search over the whole of the claimed scope is impossible. Independent of 
the above reasoning, the claims also lack clarity (Article 6 PCT). An 
attempt is made to define the compound by reference to a result to be 
achieved. Again, this lack of clarity in the present case is such as to 
render a meaningful search over the whole of the claimed scope 
impossible. Consequently, the search has been carried out for those parts 
of the claims which appear to be clear, supported and disclosed, namely 
those parts relating to the compounds 1-17 in figure 1. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no International 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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